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RECEPTOR 

The present invention relates to a receptor. In particular the present invention relates 
to the use of a receptor to screen agents to assess their suitability for subsequent use 
5 as pharmaceutical agents, such as therapeutic agents and diagnostic agents. 

Receptors are structures that bind chemical stimuli specifically and directly or 
indirectly transduce a message into the intracellular environment (Watson et al 1992 
Recombinant DNA Second Edition, Chapter 17, published by Scientific American 
1 0 Books). Some receptors, otherwise known as G-protein coupled receptors (GCRs), are 
coupled to second messenger systems via GTP-binding proteins, otherwise known as 
G-proteins. The G-proteins connect hormone receptors to adenylate cyclase or other 
signalling enzymes. 

15 In more detail, the GCRs represent the largest receptor protein class in vertebrates. 
Typically they are seven-pass transmembrane receptors (see Figure 1). In particular, 
the GCRs have been shown to be involved in the regulation of a large variety of 
physiological processes, with many of the genes encoding them mutated in genetic 
disorders and mutants. 

20 

GCRs, can be divided into six families on the basis of their amino acid sequence 
homologies (similarities) which span across the transmembrane containing region. In 
this regard, the B family, which is the second largest family, contains the receptors for 
pituitary adenylate cyclase activating polypeptide, vasoactive intestinal polypeptide 
25 (VIP), secretin, growth hormone releasing hormone, diuretic hormone, glucagon, 
giucagon-like peptide, calcitonin and gastric inhibitory polypeptide. 

GCRs can even be placed into functional categories. In this regard, several different 
types of G-protein coupling have been identified. 
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For example, the GCR will either interact with an ion channel which is itself a 
seven-pass transmembrane protein or it can interact with adenylate cyclase, 
phospholipase C or phospholipase A2, all of which signal to secondary messengers. 

5 The G-protein can either be stimulatory (Gs) or inhibitory (Gi) and can therefore 
stimulate or inhibit the action of the ion channel or second messenger pathway they 
are effecting. 

All the B family GCRs have been shown to couple to adenylate cyclase via a 
10 stimulatory G-protein. 

In this regard, the present invention provides a new receptor obtainable from animals. 
The present invention also provides a new use of that receptor. 

15 Thus, according to a first aspect of the present invention there is provided a method 
of screening a chemical for subsequent use as a pharmaceutical agent, the method 
comprising contacting the chemical with a receptor; and determining whether the 
chemical interacts with the receptor to form a chemical-receptor complex; wherein the 
receptor comprises EGF-like repeats. 

20 

According to a second aspect of the present invention there is provided a method of 
screening a chemical for subsequent use as a pharmaceutical agent, the method 
comprising contacting the chemical with a receptor, and deterrnining whether the 
chemical interacts with the receptor to form a chemical-receptor complex; wherein the 
25 receptor comprises cadherin-like repeats. 

According to a third aspect of the present invention there is provided a method of 
screening a chemical for subsequent use as a pharmaceutical agent, the method 
comprising contacting the chemical with a receptor; and detennining whether the 
30 chemical interacts with the receptor to form a chemical-receptor complex; wherein the 
receptor comprises EGF-like repeats and cadherin-like repeats. 
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Preferably, the method includes contacting the chemical-receptor complex with a G- 
protein and determining whether the chemical-receptor complex stimulates the G- 
protein. 

5 Preferably the receptor resembles or is a GCR. 

In the following commentary, the term "receptor according to the present invention" 
includes the receptor as defined in the above-mentioned aspects of the present 
invention. 

10 

According to a fourth aspect of the present invention there is provided a method of 
affecting neural development comprising stimulating a receptor with a chemical; 
wherein the receptor is the receptor according to the present invention. This method 
can be an in vitro or an in vivo method. 

15 

According to a fifth aspect of the present invention there is provided the use of the 
receptor according to the present invention to screen chemicals for subsequent use as 
a pharmaceutical. 

20 According to a sixth aspect of the present invention there is provided a chemical that 
has been screened by the method of the present invention. 

According to a seventh aspect of the present invention there is provided a receptor 
capable of interacting with a G-protein and wherein the receptor comprises EGF-like 
25 repeats. 

According to a eighth aspect of the present invention there is provided a protein 
comprising the amino acid sequence represented as SEQ. LD. No. 1, or a fragment, 
homologue or variant thereof. 

30 
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According to a ninth aspect of the present invention there is provided a protein 
comprising the amino acid sequence represented as SEQ. LD. No. 3, or a fragment, 
homologue or variant thereof. 

5 According to a tenth aspect of the present invention there is provided a nucleotide 
sequence comprising the nucleotide sequence represented as SEQ. LD. No. 2, or a 
fragment, homologue or variant thereof, and/or wherein the nucleotide sequence is 
obtainable from one or more of NCIMB No. 40766, NCIMB No. 40767 and NC1MB 
No. 40768. 

10 

According to an eleventh aspect of the present invention there is provided a nucleotide 
sequence comprising the nucleotide sequence represented as SEQ. LD. No. 4, or a 
fragment, homologue or variant thereof, and/or wherein the nucleotide sequence is 
obtainable from one or more of NCIMB No. 40766, NCIMB No. 40767 and NCIMB 
15 No. 40768. 

According to an twelfth aspect of the present invention there is provided a vector 
capable of expressing the receptor according to the present invention or the protein 
according to the present invention, or comprising the nucleotide sequence according 
20 to the present invention. 

According to a thirteenth aspect of the present invention there is provided a construct 
comprising or capable of expressing any one of the vector according to the present 
invention, the receptor according to the present invention, the protein according to the 
25 present invention, or the nucleotide sequence according to the present invention. 

According to a fourteenth aspect of the present invention there is provided a cell, 
tissue or organ comprising or capable of expressing any one of the construct according 
to the present invention, the vector according to the present invention, the receptor 
30 acwrding to the present invention, the protein according to the present invention, or 
the nucleotide sequence according to the present invention. 
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According to a fifteenth aspect of the present invention there is provided an organism 
comprising or capable of expressing any one of the cell, tissue or organ according to 
the present invention, the construct according to the present invention, the vector 
according to the present invention, the receptor according to the present invention, the 
5 protein according to the present invention, or the nucleotide sequence according to the 
present invention. 

According to a sixteenth aspect of the present invention there is provided a receptor 
capable of interacting with a G-protein and wherein the receptor comprises EGF-like 
10 repeats, wherein the receptor is obtainable from neural tissue and/or ectodermal cells. 

According to a seventeenth aspect of the present invention there is provided a receptor 
capable of interacting with a G-protein and wherein the receptor comprises cadherin- 
like repeats, wherein the receptor is obtainable from neural tissue and/or ectodermal 
15 cells. 

According to an eighteenth aspect of the present invention there is provided a receptor 
capable of interacting with a G-protein and wherein the receptor comprises EGF-like 
repeats and cadherin-like repeats, wherein the receptor is obtainable from neural tissue 
20 and/or ectodermal cells. 

According to a nineteenth aspect of the present invention there is provided an assay 
kit comprising a surface having attached thereto or contained within or on any one of 
the organism according to the present invention, the cell, tissue or organ according to 
25 the present invention, the construct according to the present invention, the vector 
according to the present invention, the receptor according to the present invention, the 
protein according to the present invention, or the nucleotide sequence according to the 
present invention. 

30 Typically the assay kit will comprise a series of titre wells capable of holding a 
suitable sample of the present invention (e.g. the receptor or the gene coding for same 
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in cells or in a cell free environment). Preferably, the assay kit comprises a series of 
titre wells, wherein at least one of which well holds a suitable sample of the present 
invention (e.g. the receptor or the gene coding for same in cells or in a cell free 
environment). Optionally, the assay kit may comprise one or more G-proteins. 

5 

As mentioned above, if the assay kit of the present invention comprises the receptor 
of the present invention then that assay kit would be usefiil for screening chemicals 
that are capable of interacting with the receptor to form a chemical-receptor complex. 
With that assay kit, the interaction of the chemical-receptor complex with the G- 
10 protein can be observed either directly or indirectly. An example of indirect 
observation is observing a change (e.g. an increase) in cAMP levels. 

Alternatively, if the assay kit of the present invention comprises the nucleotide 
sequence of the present invention then that assay kit would be useful for screening 
15 chemicals for affecting expression of that sequence. 

Other aspects of the present invention include the use of the receptor of the present 
invention to screen for agents that are capable of any one or more of 

20 stimulating the receptor to cause neural cells to divide; 

stimulating the receptor to cause neural cells to differentiate; 

stimulating the receptor to affect cellular physiology; and 

25 

stimulating the receptor for use in the repair of trauma and/or in the treatment 
of neurodegenerative diseases. 
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Other aspects of the present invention include the use of the receptor of the present 
invention for one or more of: 

stimulating adenylate cyclase; 
5 increasing cAMP levels; and 

promoting neural growth. 

These uses can be in vitro or in vivo uses. 

1 0 Other aspects of the present invention include NCIMB No. 40766, NCIMB No. 40767, 
and NCIMB No. 40768. 

Further aspects of the present invention include: 

15 a pharmaceutical preparation consisting of or comprising the receptor of the 

present invention, optionally admixed with or contained within a suitable 
carrier, diluent or excipient 

a pharmaceutical preparation consisting of or comprising the nucleotide 
20 sequence of the present invention, optionally admixed with or contained within 

a suitable carrier, diluent or excipient. 

a pharmaceutical preparation consisting of or comprising a chemical when 
screened by the method of the present invention, optionally admixed with or 
25 contained within a suitable carrier, diluent or excipient 

the use of the receptor according to present invention in the manufacture of a 
medicament to treat neural disorder. 

30 the use of the nucleotide sequence according to present invention in the 

manufacture of a medicament to treat neural disorder. 
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the use of a chemical when screened by the method of the present invention 
in the manufacture of a medicament to treat neural disorder. 

A further aspect of the present invention includes a method of treating a subject in 
5 need of, or likely to be in need of, treatment for neural disorder wherein the method 
comprises administering to the subject a receptor according to the present invention, 
or a protein expressed by the nucleotide sequence according to the present invention, 
or a chemical when screened by the method of the present invention. 



1 0 An additional aspect of the present invention includes a hybrid receptor, and genes 
coding for the same and vectors etc. comprising same, wherein the hybrid receptor 
comprises at least a part of the receptor of the present invention and at least a part of 
another receptor, such as another receptor or even part or all of a G-protein. The 
hybrid receptor is advantageous as it allows one to affect and/or to tailor the 

15 stimulation of the receptor to one or more stimuli. 

Preferably the receptor is obtainable from neural tissue. 
Preferably the receptor is obtainable from ectodermal cells. 

20 

Preferably the receptor is prepared by use of recombinant DNA techniques. 

Preferably the receptor is obtainable from deposit NCIMB No. 40766 and/or deposit 
NCIMB No. 40767. 

25 

Alternatively, the receptor is obtainable from deposit NCIMB No. 40768. 



Preferably the receptor comprises the amino acid sequence represented as SEQ. LD. 
No. 1, or is a fragment, homologue or variant thereof. 



30 
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Preferably the receptor comprises the amino acid sequence represented as SEQ. I.D. 
No. 3, or is a fragment, homologue or variant thereof. 

Preferably the receptor is expressed by the nucleotide sequence represented as SEQ. 
5 I.D. No. 2. or is a fragment, homologue or variant thereof. 

Preferably the receptor is expressed by the nucleotide sequence represented as SEQ. 
I.D. No. 4, or is a fragment, homologue or variant thereof. 

10 Preferably the chemical is screened to determine if it is useful for one or more of: 

i. causing neural cells to divide; 

ii. causing neural cells to differentiate; 

iii. affecting cellular physiology; 

iv. repairing trauma; 

v. treating neurodegenerative diseases; 

vi. stimulating adenylate cyclase production; 

vii. increasing cAMP levels; 
viiu promoting neural growth. 

Preferred embodiments of the present invention therefore include: 

i. a receptor capable of interacting with a G-protein and comprising EGF-like 
repeats and/or cadherin-like repeats, wherein the receptor is obtainable from 

25 neural tissue and/or ectodermal cells; 

ii. a receptor comprising the sequence represented as SEQ. LD. No. 1, or a 
fragment, homologue or variant thereof capable of being stimulated by a 
chemical; 



15 



20 



30 
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iii. a receptor comprising the sequence represented as SEQ. I.D. No. 3, or a 
fragment, homologue or variant thereof capable of being stimulated by a 
chemical; 

5 iv. a receptor encoded by the nucleotide sequence represented as SEQ. LD. No. 

2, or a fragment, homologue or variant thereof and wherein the expressed 
protein is capable of being stimulated by a chemical; 

v. a receptor encoded by the nucleotide sequence represented as SEQ. LD. No. 
10 4, or a fragment, homologue or variant thereof and wherein the expressed 

protein is capable of being stimulated by a chemical; 



vi. a receptor obtainable from deposit NCIMB No. 40766; 
15 vii. a receptor obtainable from deposit NCIMB No. 40767; and 
viii. a receptor obtainable from deposit NCIMB No. 40768. 



With this aspect of the present invention, the receptor may comprise a plurality of any 
20 combination of the features I to viii. as listed above. 

In a highly preferred embodiment the receptor of the present invention is not expressed 
by the natural genomic DNA sequence when in its natural environment. Thus, highly 
preferred embodiments include the receptor when prepared by use of recombinant 
25 DNA techniques, including the expression of cDNA or a synthetic nucleotide 
sequence. 

Preferably the receptor is expressed by a cDNA sequence that is obtainable from any 
one or more of deposits NCIMB 40766, NCIMB 40767 and NCIMB 40768. 

30 
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In addition, or alternatively, preferably the receptor is expressed by a cDNA sequence 
that comprises sequence shown as SEQ. I.D. No. 2, or a sequence that is 
complementary thereto, or a sequence having substantial homology therewith, or a 
sequence containing any suitable codon substitutions but wherein the resultant protein 
5 is capable of acting as receptor as herein defined. 

In addition, or alternatively, more preferably the receptor is expressed by a cDNA 
sequence that comprises sequence shown as SEQ. I.D. No. 4, or a sequence that is 
complementary thereto, or a sequence having substantial homology therewith, or a 
1 0 sequence containing any suitable codon substitutions but wherein the resultant protein 
is capable of acting as receptor as herein defined. 

In a highly preferred embodiment the nucleotide sequence coding for the receptor of 
the present invention is not in its natural environment and under the control of the 
1 5 promoter with which it is naturally associated which is also in its natural environment 

Thus, highly preferred embodiments include the use of recombinant DNA techniques 
using for example cDNA or a synthetic nucleotide sequence. 

20 In a highly preferred embodiment the nucleotide sequence coding for the receptor of 
the present invention is not in its natural environment and under the control of the 
promoter with which it is naturally associated which is also in its natural environment, 
wherein the receptor comprises the amino acid sequence shown as SEQJ.D. No. 1, 
more preferably SEQ.LD. No. 3, or variant, fragment or homologue thereof, wherein 

25 the nucleotide sequence is a cDNA sequence that is obtainable from any one or more 
of deposits NCIMB 40766, NCIMB 40767 and NCIMB 40768, and wherein the 
nucleotide sequence comprises the sequence shown as SEQ. I.D. No. 2, more 
preferably SEQ. I.D. No. 4, or a sequence that is complementary thereto, or a 
sequence having substantial homology therewith, or a sequence containing any suitable 

30 codon substitutions but wherein the nucleotide sequence codes for a protein that is 
capable of behaving as a receptor as herein defined. 
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Other embodiments of the present invention include: a transformed host organism 
(such as a microorganism, such as K coll) capable of producing the receptor 
according to the present invention as a consequence of the introduction of a nucleotide 
sequence as herein described; a method for preparing the receptor according to the 
5 present invention comprising expressing a nucleotide sequence according to the present 
invention contained in the host organism and isolating the expressed receptor; and a 
vector (such as a transformed pBLUESCRIPT plasmid, /?G£^ plasmid or pCDNA3 
plasmid) incorporating the nucleotide sequence according to the present invention. By 
way of example, the receptor can be expressed in E. co//\ baculovirus, yeast or 
10 mammalian cell expression systems. 

AH of the above-mentioned aspects of the present invention optionally include the 
combination of the receptor of the present invention with a G-protein. 

1 5 The term "chemical" includes any chemical compound, including nucleotide sequences 
both in sense and anti sense orientation, proteins, enzymes etc. The term also includes, 
a ligand, wherein a ligand is a natural substance that naturally binds to the receptor. 

The term "pharmaceutical agent" includes chemicals for use as diagnostic and/or 
20 therapeutic purposes. The term also includes pharmaceutical agents for human and/or 
veterinary applications. 

The terms "screen" and "screening" include the use of the receptor according to the 
present invention to screen agents to assess their suitability for subsequent use as 
25 pharmaceutical agents, such as therapeutic agents and diagnostic agents. 

The term "receptor" is used in its normal sense as typically meaning a protein that 
spans the membrane of a cell and that can bind, on its extra-cellular side, a chemical 
(otherwise known as a ligand). Binding of the chemical causes changes to the receptor 
30 that result in a chemical (enzymatic) reaction being initiated on the intra-cellular part 
of the receptor. These changes are the first part of a signalling chain of actions that 
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result in some change to the cells physiology. In the case of GCRs, binding of the 
chemical causes the receptor to effect a G-protein with which it is associated on the 
inner membrane surface. This disturbance results in changes to the enzymatic state 
of the G-proteiru which then interacts with the signalling system. 

5 

The term M G-protein" is used in its normal sense as typically meaning a protein which 
is associated with a receptor and which is capable of being effected by the receptor. 
Changes in the receptor (binding of chemicals/ligands) effect the enzymatic state of 
the G-protein and these changes can effect the interactions of the G-protein with other 
10 proteins which are components of a cascade of signalling events. The outcome of the 
signalling is dependent on the nature of the cell containing receptor and G-protein. 
In a preferred embodiment, the receptor interacts with a G-protein. 

The term "EGF-like repeats" is used in its normal sense as typically meaning a protein 
15 sequence similar to the following "consensus" sequence. 

CX^CX^CX^ [gCXCX^C 

wherein C is cysteine and X is any amino acid. 

20 Preferably, the receptor of the present invention comprises at least one EGF-like repeat 
and/or at least one cadherin-like repeat. Typically, the receptor of the present 
invention has between 1 and/or 36 EGF-like repeats and between 1 to 36 cadherin-like 
repeats. Preferably, the receptor of the present invention has between 1 and 20 EGF- 
like repeats and/or between 1 and 20 cadherin-like repeats. Preferably, the receptor 

25 of the present invention has between 3 and 10 EGF-like repeats and/or between 3 and 
10 cadherin-like repeats. More preferably, the receptor of the present invention has 
between 4 and 7 EGF-like repeats and/or between 6 and 10 cadherin-like repeats. 
More preferably, the receptor of the present invention has at least about 6 EGF-like 
repeats and/or at least about 10 cadherin-like repeats. 

30 
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Preferably, the receptor of the present invention comprises at least one EGF-like repeat 
and at least one cadherin-like repeat. 

Preferably, the receptor of the present invention has between 1 and 36 EGF-like 
5 repeats and between 1 to 36 cadherin-like repeats. Preferably, the receptor of the 
present invention has between 1 and 20 EGF-like repeats and between 1 and 20 
cadherin-like repeats. Preferably, the receptor of the present invention has between 
3 and 10 EGF-like repeats and between 3 and 10 cadherin-like repeats. More 
preferably, the receptor of the present invention has between 4 and 7 EGF-like repeats 
10 and between 6 and 10 cadherin-like repeats. More preferably, the receptor of the 
present invention has at least about 6 EGF-like repeats and at least about 10 cadherin- 
like repeats. 

The term "chemical-receptor complex" includes binding of the chemical to the 
15 receptor, such as by hydrogen bonding and/or covalent bonding. The chemical- 
receptor complex may then interact with a G-protein. Determinauon of the formation 
of the chemical-receptor complex can be achieved by conventional techniques. 
However, it is preferred to determine formation of the chemical-receptor complex by 
observing the effect of the complex on a G-protein, such as by observing an increase 
20 in cAMP levels. 

The term "obtainable from" includes directly or indirectly obtaining the receptor. 
Examples of indirectly obtaining the receptor include expressing the receptor cDNA 
by means of a suitable expression system. 

25 

The terms "variant", "homologue" or "fragment" include any substitution of, variation 
of, modification of, replacement of, deletion of or addition of one or more amino 
acid(s)/nucleic acid from or to the sequence providing the resultant protein is capable 
of behaving as a receptor as herein defined. 
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The expression "substantial homology", which can be otherwise expressed as 
"substantial similarity", includes homology with respect to structure and/or nucleotide 
components, providing the resultant protein is a receptor as herein defined. 

5 With respect to sequence homology (i.e. similarity), preferably there is at least 50 % 
homology, preferably at least 60% homology, more preferably at least 75% homology, 
more preferably at least 80% homology, more preferably at least 85% homology, more 
preferably at least 90% homology, such as at least 95% homology. 

10 The term "complementary" means that the present invention also covers recombinant 
nucleotide sequences that can hybridise to the recombinant nucleotide sequences. 

The term "construct" - which is synonymous with terms such as "conjugate", "cassette" 
and "hybrid" - includes all or part of the nucleotide sequence according to the present 
15 invention directly or indirectly attached to another nucleotide sequence, such as a 
promoter. 

The construct may even contain or express a marker which allows for the selection of 
the genetic construct in the host into which it has been transferred. 

20 

The construct of the present invention preferably comprises a promoter. 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site in the Jacob-Monod theory of gene expression. 

25 

The term "vector" includes an expression vector and a transformation vector. The 
term "expression vector" means a construct capable of in vivo or in vitro expression. 
The term "transformation vector" means a construct capable of being transferred from 
one species to another - such as from an E.Coli plasmid to another host 
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The terms "cell", "tissue" and "organ" include cell, tissue and organ per se and when 
within an organism. 

The term "organism" in relation to the present invention includes any organism that 
5 could comprise the recombinant nucleotide sequence coding for the receptor according 
to the present invention and/or products obtained therefrom, and/or wherein the 
recombinant nucleotide sequence according to the present invention can be expressed 
when present in the organism. 

10 Preferably the organism is a transgenic organism. The term "transgenic organism" in 
relation to the present invention includes any organism that comprises the recombinant 
nucieotide sequence coding for the receptor according to the present invention and/or 
products obtained therefrom, and/or wherein the recombinant nucleotide sequence 
according to the present invention can be expressed within the organism. Preferably 

15 the recombinant nucleotide sequence is incorporated in the genome of the organism. 

The term "protein" includes un- modified and modified proteins such as post- 
translationally modified proteins and glycosylated proteins. 

20 The receptor of the present invention is sometimes referred to as the ME2 protein. 

The following samples were deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scodand, AB2 1RY, United 
25 Kingdom, on 18 August 1995: 

E. coli XI- 1 blue containing mouse cDNA plasmid ME2(22) which was 

allocated deposit number NCIMB 40766; 

E. coli XI- 1 blue containing mouse cDNA plasmid ME2(78) which was 
allocated deposit number NCIMB 40767; 
30 E. coli DH1 containing human cosmid ME2HC20 which was allocated deposit 

number NCIMB 40768. 
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These deposits are discussed later in the Experimental Section (see Deposits). 

Thus, highly preferred embodiments of the present invention include any one of the 
aforementioned aspects of the present invention but wherein the receptor or the 
5 nucleotide sequence coding for same is obtainable from any one or more of deposits 
NCIMB 40766, NCIMB 40767 and NCIMB 40768. 

The present invention will now be described only by way of examples, in which 
reference shall be made to the following Figures, in which: 

10 



Figure 1 is a pictorial representation of a typical GCR; 



Figure 2 is a DNA map of the receptor of the present invention; 



15 



Figure 3 is a pictorial representation of the receptor of the present invention; 



Figure 4 is a schematic representation of expression patterns; 



Figure 5 is a restriction map; 



20 



Figure 6 presents SEQ ID No. 1; 



Figure 7 presents SEQ ID No. 2; 



25 



Figure 8 presents SEQ ID No. 3; 



Figure 9 presents SEQ ID No. 4; and 



30 



Figure 10 presents an analysis of an amino acid sequence (sequence range 1 
to 2707). 
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In some of the following commentary, sections (parts) of the gene coding for the 
receptor of the present invention are sometimes referred to as ME2(x) wherein "x" 
represents a particular numbered fraction. 

5 ME2 Genetic Mapping 

Our initial studies revealed that the receptor of the present invention is coded by a 
single copy gene. This single copy gene is conserved in organisms as diverged as 
human, mice and fruit flies. In particular, the single copy gene maps to human 
10 chromosome region 22^ and mouse chromosome 15. In both of these genomes the 
receptor gene is contained in a region associated with gastruiation and neural mutants 
and disorders. 

Expression Behaviour 

15 

To determine the in vivo expression of the receptor of the present invention, both 
reverse transcriptase polymerase chain reaction (RT-PCR) and wholemount in situ 
hybridisation were carried out on wild-type mouse embryos. 

20 RT-PCR analysis showed that the receptor of the present invention is first expressed 
in the early postimplantation embryo between 4 and 6 days post coitum (dpc), then 
continues until adulthood. 

The embryonic expression of the receptor of the present invention precedes the start 
25 of gastruiation, the event which results in the generation of the three germ layers of 
the developing embryo. Prior to this, the embryo does not contain neural tissue. 
Embryonic expression of the gene coding for the receptor of the present invention 
correlates with cells of ectodermal origin, which go on to form the nervous system. 



30 
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For ease of reference. Figure 4 is a schematic representation of the expression patterns 
of the receptor of the present invention in the developing central nervous system, in 
particular in the developing spinal cord (Figure 4(A)) and the developing hindbrain 
(Figure 4(B)). In this regard, interesting features of this dynamic expression pattern 
5 include the delineation of segments in the developing hindbrain and neural tube. In 
the hindbrain novel sub-rhombomeric expression was observed. In the neural tube, 
initially the transcripts are ubiquitous and then resolve into 5,4 and finally 2 
dorsoventrally restricted domains, one in the roof plate and one in the floor plate. 
Gene expression is highly localised and persists throughout development, with adult 
1 0 transcripts localised to the brain and eye. 

Expression Discussion 

Observing the pattern of expression of the receptor of the present invention indicates 
15 that it may be involved in the control of neural development. In this regard, the 
receptor is expressed almost exclusively in neural tissue (which is discussed in more 
detail later). In particular, expression precedes the first obvious neural structures in 
the developing embryo and the pattern of subsequent expression is complex. In later 
embryos, expression around the ventricle of the brain is significant since this is 
20 believed to be the region that contains the neural stem cells. 

Further observations revealed detection of the receptor of the present invention 
peri-ventricularly in adult brains. This particular pattern of synthesis is therefore under 
complex spatial and temporal controls and is the period in which the nervous system 
25 is proliferating most rapidly. 

Hence, the expression evidence strongly suggests that the receptor of the present 
invention might play an important part in the "control machinery" of neural 
development (see later discussion). 



30 
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Isolation Of The Receptor Of The Present Invention 

In some of the following commentary, sections (parts) of the gene coding for the 
receptor of the present invention are sometimes referred to as ME2(x) wherein "x" 
5 represents a particular numbered fraction. The isolation of the complete the receptor 
of the present invention coding sequence is shown in Figure 5. 

In more detail, in step 1 (Figure 5) a mouse 8.5 dpc whole embryo cDNA was 
screened using a human cDNA clone (16FB2). 16FB2 was originally isolated from a 
10 human fetal brain cDNA library by hybridisation with human cosmid ZnFP16 as 
described in Hoovers et al^ (Genomics 10 254-263). 

ME2(2) was then isolated from the initial screening of the mouse cDNA library 
(Figure 5). Extensive sequence analysis of both ME2(2) and 16FB2 has shown that 
15 they have sequence homology in a G-rich region in the 3* untranslated regioa 

Furthermore, complete nucleotide sequencing of ME2(2) showed that it had no 
homology to any sequences in the publicly accessible DNA databases. 

20 ME2(2) was then used immediately in whole mount in situ expression analyses and 
produced the striking expression pattern of the receptor of the present invention. The 
remainder of the gene sequence coding for the receptor of the present invention was 
then isolated as follows. 

25 In step 2 (Figure 5) ME2(2) was used to re-screen the mouse 8.5dpc cDNA library 
leading to the isolation of 6 different clones, the largest of which being ME2(22). 
Sequence analysis and database searches with the 3.2kb ME2(22) sequence identified 
a large open reading frame whose predicted amino acid sequence had homology to the 
family B group of G-protein coupled receptors. In this regard, ME2(22) covers the 

30 region from the polyA tail to trans-membrane region IV. 
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In step 3 (Figure 5) the 5' EcoRI to Pstl fragment of ME2(19) was used to rescreen 
which led to the isolation of 3 further cDNA clones. 

In step 4 (Figure 5) a primer PLKH20 corresponding to sequence 3527 to 3541 in the 
5 sequence shown as SEQ.I.D. No. 2 and derived from ME2(42) was used to isolate the 
HI fragment using the RACE method (Frohman, M.A. (1993) Methods in Enzymol. 
218 340-56). The DNA sequence of HI was used to identify the DNA from 3295 
to 3312 (again see SEQ. I.D. No. 2) which was used to make a new primer, PLKH26, 
for RACE analysis which gave rise R12 in step 5 (Figure 5). 

10 

In step 6 (Figure 5), R12 was used to screen the cDNA library which gave rise to, 
amongst other clones, the cDNA clone ME2(78) which extends almost to the 5*end. 

The minimal set of cDNA clones that defines all of the receptor of the present 
15 invention is ME2(78) and ME2(22) (see Figure 2 and Figure 5). 

DNA Analysis 

The DNA sequence of the receptor of the present invention was established entirely 
20 using published methods. 

In particular, the sequencing methodology used was the Sanger technique (Sanger et 
al, 1977 Proc. Natl. Acad. Sci. USA. 12 5463-7). The sequencing kits used were 
supplied by Pharmacia Biotech and the manufacturer's protocols were followed 
25 throughout, 

DNA was sequenced either by analysis of cloned molecules using sequencing primers 
specific for vector sequences and sequencing into the cDNA, or by synthesising 
specific primers, obtained from conventional commercial synthesis companies, and 
30 using these to establish DNA sequence directly from internal parts of the cloned cDNA 
molecule. 
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The sequence data obtained is shown in the attached Figures as SEQ. I.D. No. 2 (see 
Figure 7) and SEQ. I.D. No. 4 (see Figure 9). A map of the DNA sequence is 
represented in Figure 2. 



5 Re-isolation Of The Receptor Of The Present Invention 



The receptor of the present invention was re-isolated by using PCR techniques. 



Since the gene coding for the receptor of the present invention is over 4.5 kb it is 
10 preferable to isolate a cDNA containing the receptor coding regions by a multi-step 
process, rather than by a one step process using RT-PCR to isolate the whole cDNA. 
Hence, by using the complete coding sequence for the receptor of the present invention 
it is possible to isolate a series of cDNA fragments that can then be ligated. 

1 5 In this regard, primers flanking pairs of unique restriction enzyme sites were used to 
amplify individual regions and subsequent restriction enzyme digestion and ligation 
to generate the complete sequence. Details of this approach are given below and in 
Figure 2. 

20 Fragment 1 : Primer pair(3 ') PLKH3 1 +(5 y ) PLKH52 

Amplification products are digested with BspHl. 



25 



30 



Fragment 2: 



Primer 

PLKH59C 

PLKH47 

PLKHS2 

PLKH31 



Primer pair^') PLKH47+(5 , )PLKH59c 
Amplification products are digested with BspHl 



Sequence (5' -3'} Position 

5' CAG CGG GGA CTA CTG CGA GAC TGA AAT 1-27 

5* AGC TTG TCG AAG ATG TCA AC 2675-2694 

S' ATC TTA CAG CAT GAG AGC CGC C 2414-2435 

5' GGT AAT GAC ACA GTC ACT GGC ATG 4856-4879 
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Fragments 1 and 2 were independently amplified from mouse embryonic or adult brain 
reverse transcribed cDNA under standard PCR conditions- The amplification products 
were then subsequently directly restriction endonuclease digested with BspHI to give 
ragged ends. Fragments I and 2 were then ligated to each other and then cloned into 
5 a T vector. 

Construction Of A Complete cDNA Clone 

Construction of a complete cDNA clone for the gene for the receptor of the present 
1 0 invention was as follows. 

In particular, construction of a complete cDNA clone for the gene for the receptor of 
the present invention was achieved using the protocol detailed in the previous section. 

15 In more detail, the method used relied on the minimal set of cDNA clones obtained 
from the mouse 8.5dpc libraries mentioned above (see also Figure 2 and Figure 5). 

The minimal set of clones ME2(22) and ME2(78) have EcoRl linkers and are cloned 
into pBluescript plasmid vector. Since there is no EcoRl site in the ME2 transcript, 
20 construction of a complete cDNA clone will be done by EcoRl + Avrll digestion of 
ME2(22), isolation of the 2.6kb fragment and ligation of this to the 4.3kb EcoRl plus 
Avrll fragment from ME2(78): this product is cloned into the appropriate EcoRl 
digested vector. 

25 Deposits 

As mentioned above three deposits have been made in accordance with the Budapest 
Treaty. In this regard, NCIMB 40766 is an E. coli XI- 1 blue host with a pBluescript 
SK+ vector containing fragment ME2(22) - i.e. nucleotides 3657 to 6794 (see 
30 SEQ.LD.No. 2). 
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NCIMB 40767 is an E. coli XI- 1 blue host with a pBluescript SK+ vector containing 
fragment ME2(78) - i.e. nucleotides 1 to 4813 (see SEQ.I.D. No. 2): 

NCIMB 40768 is a recombinant cosmid containing the main part of the human 
5 receptor gene according to the present invention. In this regard, the cosmid vector is 
pCos2EMBL and the host cell £ coli DHL The human DNA derives from a region 
of human chromosome 22^ and contains parts of the human receptor of the present 
invention gene including parts- of the 7TM region but not extending further 5* than 
this. 

10 

In order to prepare a full length cDNA clone from NCIMB 40766 and NCIMB 40767, 
the appropriate cDNA fractions can be excised by use of suitable restriction enzymes, 
isolated and then ligated. The full length cDNA can then be inserted into any suitable 
expression system and subsequently expressed by suitable means. The pBluescript 
15 SK+ plasmids can be recovered from bacterial cells grown in L-broth containing 100 
(Hg/ml Ampicillin using routine methods detailed in Sambrook et aL, (1989 Molecular 
cloning - A laboratory manual. Second edition. Cold Spring harbor laboratory press.). 
Then the receptor cDNA fragments may be isolated subsequent to their excision with 
£co*I. 

20 

Likewise, the DNA from NCIMB 40768 and/or fragments thereof can be excised by 
use of suitable restriction enzymes, isolated, inserted into any suitable expression 
system and subsequently expressed by suitable means. The DNA can be recovered by 
growing the bacteria in L-broth supplemented with 30 (ug/ml Kanamycin and 
25 recovering the DNA according to routine methods detailed in Sambrook et ai y (1989 
Molecular cloning - A laboratory manual. Second edition. Cold Spring harbor 
laboratory press). The human DNA fragments can be resolved by cleavage with 
almost any 6-base recognition enzyme. 



30 
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Amino acid analysis 

The amino acid sequence data are listed in the attached sequence listings as SEQ. LD. 
No. 1 (see Figure 6) and SEQ. LD. No. 3 (see Figure 8). 

5 

Amino acid analysis of the receptor of the present invention reveals that it appears to 
be a large membrane spanning receptor having an unusual structure. This structure 
is pictorially shown in Figure 3. 

10 In slightly more detail, the C-terrninal region appears to have the structure of a 7-pass 
transmembrane receptor related to the B family of G-protein coupled receptors 
(GCRs). Thus it is believed that the receptor of the present invention is a new protein. 

Further amino acid analysis reveals that the receptor of the present invention contains 
1 5 EGF-like repeats. In this regard, towards the N-termmus (extracellular) of the receptor 
there are a number of EGF-like repeats (see Figure 3). One of these EGF-like repeats 
is -300 amino acids away from the C-tenninal sequence. The EGF-like repeats are 
shown in Figure 10 (marked "EGF 1" etc.). Divergent EGF-like repeats are also 
marked. 

20 

A large number of molecules containing EGF-like repeats have been identified in both 
vertebrates and invertebrates. These molecules include, for example, blood clotting 
factors and proteins that are required for correct embryonic development. 

25 Examples of proteins that are required for correct embryonic development, which 
molecules have primarily been characterised in invertebrates, include fibropellin, a cell 
coat protein of sea urchins, glp-1 and lag- 12 proteins required for inductive 
interactions in the nematode worm C elegans, and a number of Drosophila proteins 
including Crumbs, which is required for establishing epithelial cell polarity. 

30 Additional examples include Notch, Delta and Serrate proteins, which are required for 
neurogenesis, and Slit which is a protein involved in axonal pathfinding. Notch and 
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its ligands Delta and Serrate are involved in cell-cell signalling that determines 
adjacent cell fates; though this signal is not directly mitogenic. 

To date, there have been reports of some isolated vertebrate proteins that have some 
5 homology (similarity) to the invertebrate proteins. For example, three Notch genes, 
a single Delta, Jagged and Delta-like have been identified to date. 

Other vertebrate proteins that have been isolated are EMR-1 (Baud et al 1995 
Genomics 26 334-344) and CD97 (Genebank accession no. XS4700). However, there 
10 is no mention of possible utility of such proteins, let alone mention of pattern of 
expression/synthesis. 

Further amino acid analysis reveals that the receptor of the present invention contains 
cadherin-Iike repeats. These cadherin-like repeats are shown in Figure 10 (marked as 
15 "CD 1" etc.). Cadherin-Iike repeats have been implicated in protein-protein 
interactions (Geiger and Ayalon 1992 Ann Rev Cell Biol 8 307-332). 

Some of the transmembrane portions of the receptor of the present invention are 
shown in Figure 10 (marked as "TM 1" etc.). 

20 

When the amino acid sequence of the receptor of the present invention is compared 
with the amino acid sequences of known proteins that are required for correct 
embryonic development it is observed that there is some sequence homology, though 
this is less than 80%. More importantly, however, in distinction to the receptor of the 
25 present invention those proteins are all single-pass transmembrane proteins with the 
cluster of EGF-like repeats in their extracellular domains. In contrast, the receptor of 
the present invention has a seven-pass transmembrane topology, similar to a GCR. 

Accordingly, as there have been no reports in the literature for a receptor found in 
30 neural tissue that is capable of interacting with a G-protein but, in addition, having 
EGF-like repeats so the receptor of the present invention is novel. 
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Functions Of The Receptor Of The Present Invention 

The EGF-Iike repeats and the 7 pass transmembrane (7TM) structures of the receptor 
of the present invention suggest important functions and utilities for the receptor. In 
5 this regard, the EGF-like repeats will bind a ligand which may, as in the case of 
Notch/Delta, be a protein attached to another cell, or it may be free, as in the case of 
blood clotting factors. In either case, binding of a ligand will cause the cytoplasmic 
region to signal to the cellular machinery, via a G-protein. Since the 7TM region is 
most similar to the family B GCRs regions, it is likely that it will signal via a 
10 stimulatory G-protein which stimulates adenylate cyclase and causes cAMP levels to 
increase as all family B receptors operate in this fashion. 

Likewise, the cadherin-like repeats of the receptor of the present invention suggest 
important functions and utilities for the receptor. 

15 

Without wishing to be bound by theory, it is believed that the outcome of this 
signalling, based upon known examples of GCRs, could be due to one or more of the 
following effects: 

20 1. Stimulation of the receptor of the present invention might cause neural cells 
to divide. 

2. Stimulation of the receptor of the present invention might cause neural cells 
to differentiate. 

25 

3. Stimulation of the receptor of the present invention might cause changes to 
cellular physiology. 
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Uses 

Based on the above-mentioned functions of the receptor of the present invention, it is 
clear that the receptor can be used in a number of useful utilities. Some of those 
5 utilities are now presented. 

1. The receptor of the present invention could be a therapeutic target. In this regard, 
if the ligand or a modified ligand can be defined, then artificial treatment of neural 
tissue with this ligand could trigger the receptor of the present invention to signal. 

10 This signal would then trigger the normal response of the receptor of the present 
invention, which would be a way of modulating the growth, function or properties of 
brain cells that express the receptor of the present invention. This would therefore 
have an application in the repair of trauma and in treatment of neurodegenerative 
diseases, such as Parkinson's disease and Alzheimer's disease. 

15 

2. The receptor of the present invention can be used as a test reagent to identify the 
ligand. This can be done for example in artificial test systems where the receptor of 
the present invention is expressed from a recombinant expression vector in cells that 
otherwise do not express the receptor of the present invention- These cells can then 

20 be used to test for binding of the ligand by studying cAMP level changes upon 
treatment of cells with proteins or other cells. 

3. The nucleotide sequence coding for the receptor of the present invention gene could 
be used to modify the behaviour of cells by transgenesis. Potential areas of 

25 application include the modification of cells used for transplantation treatments of 
degenerative diseases and the modification of whole animals by normal transgenic 
procedures. Both of these applications would result in cells with modified growth 
potential. 
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Screening Protocol 

Two screening protocols are now presented. 

5 The first is based on Lutz et al. (1993. FEBS Letters 334, 3-8). In outline, a ME2 
expression vector (for example pcDNA-1) is constructed and then introduced into Cos- 
7 cells by transfection. This results in the cells expressing ME2 and, because of the 
biological properties of G-proteins, a G-protein becomes naturally associated with 
ME2 on the cell surface. The cells are then treated in vitro with proteins, chemicals, 
10 other cells (intact or broken up). Then one assays for changes to cAMP levels which 
are caused by ME2 binding a ligand and signalling via the G-protein to alter cAMP 
levels. If changes are seen, this implies that the ligand is or is contained in, in the 
substance that was treated with cells. This is the assay of choice for purification of 
the ligand. 

15 

The second method is based upon Cheng & Flanagan (1994, Cell 79 157-168). In this 
regard, one synthesises in E. coli and isolates the N-terminal fragment of ME2 (N 
terminus of ME2 to the membrane entry point in fig 3) which has been fused at this 
point to the enzyme alkaline phosphatase. This hybrid protein binds to its normal 
20 ligand. Binding is then detected by looking for the alkaline phosphatase dragged along 
at its end. This could be used to isolate cDNA clones containing the normal ME2 
ligand using exactly the methods detailed in Cheng & Flanagan (1994, Cell 79 157- 
168). 

25 The present invention therefore relates to a novel receptor and a novel use of that 
receptor. 

The nature of the receptor of the present invention, its spatiotemporally restricted 
expression, coupled with the evolutionary conservation of the receptor gene suggests 
30 that the receptor of the present invention plays a role in the detennination of 
ectodermal cell types within the developing embryo. This has important consequences 
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as it enables possible pharmaceutical agents to be screened to see if they stimulate the 
receptor and if so then those agents could be used to promote neural growth. In 
addition, the receptor can be inserted (such as by way of transplantation or by way of 
transgenesis of the coding gene) into a subject either in need of treatment or to 
5 develop in vivo screening techniques. 

Other modifications of the present invention will be apparent to those skilled in the 
art without departing from the scope of the invention. 
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CLAIMS 

1. A method of screening a chemical for subsequent use as a pharmaceutical 
agent, the method comprising contacting the chemical with a receptor; and detennining 

5 whether the chemical interacts with the receptor to form a chemical-receptor complex; 
wherein the receptor comprises EGF-like repeats. 

2. A method of screening a chemical for subsequent use as a pharmaceutical 
agent, the method comprising contacting the chemical with a receptor: and determining 

1 0 whether the chemical interacts with the receptor to form a chemical-receptor complex; 
wherein the receptor comprises cadherin-like repeats. 

3. A method of screening a chemical for subsequent use as a pharmaceutical 
agent, the method comprising contacting the chemical with a receptor; and detenmning 

1 5 whether the chemical interacts with the receptor to form a chemical-receptor complex; 
wherein the receptor comprises EGF-like repeats and cadherin-like repeats. 

4. A method according to any one of claims 1 to 3 wherein the receptor is 
obtainable from neural tissue. 

20 

5. A method according to any one of claims 1 to 4 wherein the receptor is 
obtainable from ectodermal cells. 

6. A method according to any one of the preceding claims wherein the receptor 
25 is prepared by use of recombinant DNA techniques. 

7. A method according to any one of the preceding claims wherein the receptor 
is obtainable from deposit NCIMB No. 40766 and/or deposit NCIMB No. 40767. 



30 



8. A method according to any one of claims 1 to 6 wherein the receptor is 
obtainable from deposit NCIMB No. 40768. 
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9. A method according to any one of the preceding claims wherein the receptor 
comprises the amino acid sequence represented as SEQ, I.D. No. 1, or is a fragment, 
homologue or variant thereof. 

5 10. A method according to any one of the preceding claims wherein the receptor 
comprises the amino acid sequence represented as SEQ. I.D. No. 3, or is a fragment, 
homologue or variant thereof. 

11. A method according to any one of the preceding claims wherein the receptor 
10 is expressed by the nucleotide sequence represented as SEQ. I.D. No. 2, or is a 

fragment homologue or variant thereof. 

1 2. A method according to any one of the preceding claims wherein the receptor 
is expressed by the nucleotide sequence represented as SEQ. I.D. No. 4, or is a 

15 fragment, homologue or variant thereof. 

13. A method according to any one of the preceding claims wherein the method 
includes contacting the chemical-receptor complex with a G-protein and determining 
whether the chemical-receptor complex stimulates the G-protein. 

20 

14. A method according to any one of claims 1 to 13 wherein the chemical is 
screened to determine if it is useful for one or more of: 





i. 


causing neural cells to divide; 


25 


ii. 


causing neural cells to differentiate; 




iii. 


affecting cellular physiology; 




iv. 


repairing trauma; 




v. 


treating neurodegenerative diseases; 




vi. 


stimulating adenylate cyclase production; 


30 


vii. 


increasing cAMP levels; 




viii. 


promoting neural growth. 
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15. A method of affecting neural development comprising stimulating a receptor 
with a chemical; wherein the receptor is the receptor as defined in any one of claims 
1 to 14. 

5 16. Use of a receptor as defined in any one of claims 1 to 14 to screen for agents 
that are capable of stimulating the receptor to cause neural cells to divide. 

17. Use of a receptor as defined in any one of claims 1 to 14 to screen for agents 
that are capable of stimulating the receptor to cause neural cells to differentiate. 

10 

18. Use of a receptor as defined in any one of claims 1 to 14 to screen for agents 
that are capable of stimulating the receptor to affect cellular physiology. 

19. Use of a receptor as defined in any one of claims 1 to 14 to screen for agents 
15 that are capable of stimulating the receptor for use in the repair of trauma and/or in 

the treatment of neurodegenerative diseases. 

20. Use of a receptor as defined in any one of claims 1 to 14 to stimulate adenylate 
cyclase. 

20 

21. Use of a receptor as defined in any one of claims 1 to 14 to increase cAMP 
levels. 

22. Use of a receptor as defined in any one of claims 1 to 14 to promote neural 
25 growth. 

23. Use of a receptor as defined in any one of claims 1 to 14 to screen chemicals 
for subsequent use as a pharmaceutical. 

30 24. A chemical when screened by the method according to any one of claims 1 to 
15. 
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25. A receptor capable of interacting with a G-protein and wherein the receptor 
comprises EGF-Hke repeats, wherein the receptor is obtainable from neural tissue 
and/or ectodermal cells. 

5 26. A receptor capable of interacting with a G-protein and wherein the receptor 
comprises cadherin-like repeats, wherein the receptor is obtainable from neural tissue 
and/or ectodermal cells. 

27. A receptor capable of interacting with a G-protein and wherein the receptor 
10 comprises EGF-like repeats and cadherin-like repeats, wherein the receptor is 

obtainable from neural tissue and/or ectodermal cells. 

28. A receptor according to any one of claims 25 to 27 wherein the receptor is the 
receptor as defined in any one of claims 4 to 14. 

15 

29. A protein comprising the amino acid sequence represented as SEQ. I.D. No. 
1, or a fragment, homologue or variant thereof. 

30. A protein comprising the amino acid sequence represented as SEQ. I.D. No. 
20 3, or a fragment homologue or variant thereof. 

31. A nucleotide sequence comprising the nucleotide sequence represented as SEQ. 
I.D. No. 2, or a fragment, homologue or variant thereof, and/or wherein the nucleotide 
sequence is obtainable from one or more of NCIMB No. 40766, NCIMB No. 40767 

25 and NCIMB No. 40768. 

32. A nucleotide sequence comprising the nucleotide sequence represented as SEQ. 
I.D. No. 4, or a fragment, homologue or variant thereof, and/or wherein the nucleotide 
sequence is obtainable from one or more of NCIMB No. 40766, NCIMB No. 40767 

30 and NCIMB No. 40768. 
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33. A vector capable of expressing the receptor as defined in any one of claims 1 
to 14 or the protein of claim 29 or 30, or comprising the nucleotide sequence as 
defined in claim 3 1 or 32. 

34. A construct comprising or capable of expressing any one of the vector of claim 
33, the receptor as defined in any one of claims 1 to 14 or the protein of claim 29 or 
30, or comprising the nucleotide sequence as defined in claim 31 or 32. 

35. A cell, tissue or organ comprising or capable of expressing any one of the 
construct of claim 34, the vector of claim 33, the receptor as defined in any one of 
claims 1 to 14 or the protein of claim 29 or 30, or comprising the nucleotide sequence 
as defined in claim 3 1 or 32. 



36. An organism comprising or capable of expressing any one of the cell, tissue 
15 or organ of claim 35, the construct of claim 34, the vector of claim 33, the receptor 
as defined in any one of claims 1 to 14 or the protein of claim 29 or 30, or 
comprising the nucleotide sequence as defined in claim 31 or 32. 



10 



20 



37. An assay kit comprising a surface having attached thereto or contained within 
or on any one of an organism according to claim 36, the cell, tissue or organ of claim 
35, the construct of claim 34, the vector of claim 33, the receptor as defined in any 
one of claims 1 to 14 or the protein of claim 29 or 30, or comprising the nucleotide 
sequence as defined in claim 31 or 32. 



25 38. NCIMB No. 40766. 
39. NCIMB No. 40767. 



40. NCIMB No. 40768. 
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41. A method substantially as described herein. 

42. A use substantially as described herein. 

5 43. A receptor substantially as described herein. 

44. An amino acid sequence substantially as described herein. 

45. A nucleotide sequence substantially as described herein. 
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FIG. U 
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AMINO ACID SEQUENCE OF ME2 REGION COVERED BY ME2C78) AND ME2(22) 
1573 RESIDUES 



GDYCETE I DLC YSNPCGANGGCRSREGGYTCECFEDFTGEHCQVNVRSGRCASGVCK 
NGGTCVNLLIGGFHCVCPPGEYEHPYCEVSTRSFPPQSFVTFRGLRQRFHFTVSLAF 
ATQDRNALLLYNGRFNEICHDFIALEIVEEQLQLTFSAGETTTTVTPQVPGGVSDGRW 
HSVLVQYYNKPNIGHLGLPHGPSGEKVAVVTVDDCDAAVAVHFGSYVGNYSCAAQGT 
QSGSKKSLDLTGPLLLGGVPNLPEDFPVHSRQFVGCMRNLSIDGRIVDMAAFIANNG 
TRAGCASQRNFCDGTSCQNGGTCVNRWNTYLCECPLRFGGKNCEOAMPHPQRFTGES 
VVLWSDLDITISVPWYLGLMFRTRKEDGVLMEATAGTSSRLHLQILNSYIRFEVSYG 
PSDVASMQLSKSRITDGGWHHLLIELRSAKEGKDIKYLAVMTLDYGMDQSTVQIGNQ 
LPGLKMRTIVIGGVTEDKVSVRHGFRGCMQGVRMGESSTNIATLNMNDALKVRVKDG 
CDVEDPCASSPCPPHRPCRDTWDSYSCICDRGYLEKKCVDACLLNPCKHVGSLCALP 
NTPRGYSCECGPGHYGQYCENKVDLPCPKGWWGNRCVAPV7VLSAKAL I PTATRPMA 
SARRITTSPQPRIVAFPVTVSPRSHSRACDMDTGQCACKPGVIGRQCNRCDNPFAEV 
TSLGCEV I YNGCPRAFEAGI WWPQMKFGQPAAVLCPKGSVGNAVRHCSGEKGWLPPE 
LFNCTSGSFVDLKALNEKLNRNETRMDGNRSLRLAKALRNATQGNSTLFGNDVRTAY 
QLLARI LQHESRQQGFDLAATREANFHEDWHTGSALLAPATEASWEQI QRSEAGAA 
QLLRHFEAYFSNVARNVKRTYLRPFVI VTANMI LAVDI FDKLNFTGAQVPRFEDIQE 
ELPRELESSVSFPADTFKPPEKKEGPVVRLTNRRTTPLTAQPEPRAERETSSSRRRR 
HPDEPGQFAVALVVIYRTLGQLLPEHYDPDHRSLRLPNRPVINTPWSAMVYSEGTP 
LPSSLQRPILVEFSLLETEERSKPVCVFWNHSLDTGGTGGWSAKGCELLSRNRTHVT 
COCSHSASCAVLMDISRREHGEVLPLKIITYAALSLSLVALLVAFVLLSLVRTLRSN 
LHS I PQEP I HALFFSQL I FMVG I NQTENPFLCTVVAI LLHYVSMGTFAWTt VENLHV 
YRMLTEVRN IDTGPMAFYH VVGWG I PAI VTGLAVGLDPQGYGNPDFCWLSLQDTL I W 
SFAGPVGTVI IINTVIFVLSAKVSCQRKHHYYERKGWSMLRTAFLLLLLVTATWLL 
GLLA VNSDTLSFHYLFAAFSCLQG I FVILF YC VANREVRKHLRAVLAGKKIQLDDSA 
TTRATLLTRSLNCNNTYSEGSRHAPHRPGQSTASLDSTTRDEGVQKLSVSSGPARGN 
HGEPDASF I PRNSKKAHGPDSDSDSELSLDEHSSSYASSHTSDSEODGGEAEDKWNP 
AGGPAHSTPKADALANHVPAGWPDESLAGSDSEELDTEPHLKVRPRSAWSYTGRRRA 
ITVATGPLTOVGSWPSQWPCLAASPRSSGKAS* 
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COMPLETE NUCLEIC ACID SEQUENCE OF ME2 REGION COVERED BY ME2C78) AND ME2C22) cDNA 
CLONES 

NUCLEOTIDES 1 - 6791 



CCGGG6ACTACTGCGAGACT6AAATTGATCTTTGCTACTCCAATCCGTGCGGGGCCA 
ATGGCGGCTGCXffiAGCCGTGAGGGTGGCrACACTTGTGAGTGCITCGAGGACTTCA 
CTGGGGAGWTTGCCAGGTGMCGnCGCTCAGGCCGCTGTGCCAGCGGAGTATGCA 
AAAACGGGGGCACCTGCGTGAACCTGCTCATTGGAGGCTTCCACTGTGTGTGCCCGC 
CCGGCGAGTATGAG^TCCCTACTGTGAAGTGAGCACOVGGAGCTTCCWCCCWiGT 
COTCOTACCnCCGA(^TGCG(£^^ 

nGCCACCCAGGACAGGMTGCCCTGCTGCTCTACMTGGCCGCTTCAATGAGAAGC 
ACGACTTC^TCGCCCTGGAGATTGTGGAGGAGCAGCTGCAGCTCACGTTCTCGGCAG 
GTGAGACCACAACCACGGTGACACCGCAGGTTCCTGGAGGTGTGAGCGATGGGCGGT 
GGCATTCGGTGCTGGTGCAGTACTACAACAAGCCCAACATTGGCCACCTGGGCCTGC 
CCCACGGGCCGTCTGGAGAGAAGGTGGCTGTGGTGACTGTGGATGACTGTGACGCAG 
CGGTGGCCGTGCACTTTGGAAGTTACGTGGGGAACTACAGCTGCGCTGCCCAGGGCA 
CTCAGAGCGGCTCCAA6AAGTCACTGGATCTGACTGGTCCTCTGCTTCTGGGTGGTG 
TCCCCMCCTGCCAGAAGACTTCCCCGTGOOGCCGTt^TTTGTGGGATGCATGC 
GAAACCTGTCCATCGATGGCCGGATTGTGGAWTGGCTGCGTTTATTGCCAACAATG 
GTACCAGGGCAGGCTGTGCTTCTCAGAGGMCTTCTGC^TGGGACCTCATGCCAGA 
ACQQGGGCACCT^GTGAAOVGGTGGAACACGTACnATGTGAGTGCCCGCTCOaCT 
TTGGTGGAAAGAACTGTGAACAAGCTATGCCACA(XCTCAGCGC1TCACTGGTGAGA 
GCGTCGTGTTGTGGAGTGACCTrGACATCACWTCTCTGTGCCTTGGTACCTGGGGC 
TWTCTTCCGGACCCGGAAGGAGGATGGTGTGCTGATGGAAGCCACAGCTGGCACGT 
(XTCOGGCTCCXrCTCf^T^ 

GCCCCTCTGACGTGGCATCCATGCAGCTGTCCAAGTCCCGGATAACTGACGGGGGGT 
GGCATCACCTGCTCATAGAACTGAGGAGTGCCAAGGAGGGCAAGGACATCAAATACC 
TGGCAGTCATGACCnGGACTATGGGATQGACCAGAGCACAGTGCAGATTGGGAATC 
AGCTTCCTGGGTTGAAGATGCGGACTATTGTCATCGGAGGTGTGACCGAGGACAAGG 
TCTCTGTCCGCWTGGTTTC(^AGGCTGTATGCAGGGAGTGAGGATGGGAGAGAGCT 
CWCCAAWTTGCCACCCTGAACATGAATGACGCCCTCAAGGTCAGGGTGAAGGACG 
GCTGTGATGTffiAGGACCCATGTGCCTCAAGCCCCTGCCCTCCCCATAGACCCTGCC 
GTGACACATGGGACAGCTACTCCTGCATCTGTGACAGAGGGTACTTGGAAAAAAAGT 
GTGTGGATGCGTGTCTCCTGAACCCCTGCW(>CGTTGGCAGCCTGTGTGCGCTCC 
CCAACACTCCTCGAGGCTACTCCTGCGAGTGCGGACCCGGCCACTATGGGCAGTACT 
GTGAGAACAAAGTCGACCTTCCGTGCCCCAAAGGCTGGTGGGGGAACCGGTGTGTGG 
CCCCTGTCACTGTGCFGTCAGC£AAGGCT^ 

CCAGTGCAAGGAGAATTACT ACAAGCCCCCAGCCCAGGATCGTTGCCTTCCCTGT6A 

CTGTTTCCCCCCGCTCCWCAGCCGTGCCTGCGAWTGGACACTGGGCAGTGTGCCT 

GCAAGCCTGGTGTCATCGGCCGTCAGTGCMCCGCTGTGATAATCCTTTCGCGGAGG 

TWCTCGCTCGGCTGTGAAGTGATCTACMTGGGTGTCCCAGAGWTTTGAGGCTG 

(XATCTGGTGGCCACAGATGAMmGGGCAGCCAGCAGraTGCTATGCCCCAAA^ 

GATCCGTGGGTMCGCAGTCCGGCACTGCAGTGGGGA6AAGGGCTGGCTTCCCCCAG 

AGCTCnCMCTGCACCTCTGGCTCCTTTGTGGACCTWGGCCTTGAACGAGAAAC 

TGMCCGCMCGAGACAAGAATGGACGGGAACCGGTCCCTGCGGCTGGCAAAGGCTC 

TGAGGAACGCDCGCAGGGGAAt^GCACCCTCTiTGGWATGATGTGCGCACGGCCT 

ACCAGCTTCTGGCCCGCATCTTACAGCATGAGAGCCGCC^GCAGGGCTnGACCTGG 

CAGCCACCCGAGAGGCTMTTTTCATGAGGATGTCGTCCATACAGGCAGCGCCCTCC 

TGGCCCCAGCTACAGAGGCATCGTGGGAACAGATCCAGCGAAGCGAffiCTGGTG(^ 

(XCAGCTACTGAGGWTnCGAGGCATACTT(^CM{^TGGC^GAMTGTGAA^ 

G6ACCTATCTGAGGCCCHCGTttTCGTCACCGC(^(ATGATrCTTG(^TTGAW 

TCTTCGACAAGCTGMCTTWCH^GCCCAGGTG^ 

AAGAGCTCCCAAGGGAGCTGGAGTCCTCCGTGTCCTTCCCAGCTGACACCTTCAAGC 
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CACCAfiAGAAAAAAGAAGGCCCCGTSGTGAGGCTGACCAACCGGAGGACTACCCCAC 
TCACGG(>CAACCAfiAGCCCAQGGCT6AGAGGGAMCCTCATCCAGCA6ACGGAQ6A 
GAWCCCCGATGAGCCTGGACAGTTTGCTGTTGCCCTGGTrGTCA'TTTACCGGACCC 
TGGGTCAGCTGCTGCCT6AACACTAT6ACCCCGACCATCGCAGCCTCCGACTGCCTA 
ACCGGCCTGTCATCAACACCCCC6TGGTGAGTGCTATGGTGTACAGTGAQGGAACCC 
CACTCCCCAGCTCTCTGCAGAGGCCTATCCT^^ 

AGGAACGAAGCAAACCTGTCTGTGTATTCTGGAACCACTCCCTCGACACTGGTGGGA 

CTGGAGGGTGGTCAGCCAAGGGCTGTCAACTTCTGTCGAGGAACCGGAC^ 

CnGCCAGTGCAGCWTTCGGCWGCTGCGCGGTGCTCATGGACATTTCCAGACGTG 

AGCACGGGGAffiTTCTGCCCCTGAAGATCATCACCT ATGCCGCCCTGTCCTTGTCTT 

TGGTGGCCCTCCTGGTGGCCrTTGTCCnCTCTCGCTCGTTCGGACACTGCGCTCCA 

ACCTGCACAGCATCCCACAAGAACCT ATCCACGCTCTGTTCTTCTCCCAGCTCATCT 

TCATGGTCGGCATCMCWGACTGAGAACCCG1TTCTCTGCACAGTGGTCGCCATCC 

TCCTGCACTACGTCTCCATGGGWCCnCGCCT^GACCCTTGTGGAGAACTTGCATG 

TCTACCGCATGCTGACAGAAGTGCGOWCATCGACACTGGGCCCATGGCGTTCTACC 

ACGTGGTGGGCTGGGGCATCCCTGCCAnGTCACAGGACTGGCTGTTGGCTTGGACC 

CTCAGGGCTATGGAMCCCTGACTrCTGCTGGCTGTCCCTrCAGGATACCCTGATn 

GGAGCrnGCTGGGCCTGTCGGMCGGTTATMTCATCAACACAGTCATCTTTGTCC 

TGTCTGCAAAGGTTTCCTGCCAAAGAAAGCACCATTAnA^ 

TCTCCATGCTGAGGACGGCCTTCCTCCTGCTGCTGCTCGTCACTGCCACCTGGCTGC 

TGGGACTGCT(£CGGTCAACAGTGACACT CTTAGCTTTCACTACCTCTTTGCTGCCT 

TCAGCTGCTTGCAGGGCATCTnGTCCTCCTGnCTACTGCGTGGCCAACAGGGAGG 

TGCGGAAGCACCTGAGGGCGGTGCTGGCAGGGAAGAAGCTGCAGCTGGATGACTCGG 

CCACCACTCGGGCCACTCTGCTMCGCGCTCCCTCMCTGCMCAACACCTACAGCG 

AA^TCCAGACATGCTCCGCACCGCCCTGGGCAGTCCACA^CTCTCTQGACAGTA 

CWCCAGGGATGAAGGGGTCCAGAMCTCAGTGTGTCCTCTGGCCCAGCCCGTGGTA 

ACCATGGAGAACCAfiATGCATCCnCATCCCTAGGMCTCCAAAAAAGCTCACGG^ 

CTGACTCTGACTCTGACAGTGAGCTGTCCCTGGACGA^CACAGTAGTTCCTACGCCT 

CTT(^CACACATCGGACAGCGAGGATGATGGCGGAGAG^TGAAJGACAMTGGAATC 

CGGCTGGGGGCCCCGCCCATAGCACCCCAAAAGCAGATGCTCTGGCO\ACCACGTCC 

CAGCTGGCTGGCCCGACGAGAGCCTGGCTGGG^ 

AGCCCCACCTGAAGGTGAGACCAAGGTWGCGTGGAGTTACACCGGCAGGCGCAGGG 

(^TCACTGTGGCGAC(X3GCCCTCTGACCCG^ 

GGCCGTGCnAGCAGCCAGCCCCAGGAGCAGCGGAAAGGCATCCTGAAAA 

CACCT ACCCGCCGGCCATTGCCAGAGCAGCCACTGAAGTCCCGGCTGCGAGAGAAGC 

TGGCTGATTGTGAGCAGAGCCCCACATCCTCCCGCACATCCTCCCTTGGCTCTGGCG 

ATGGTGTCCATGCCACTGACTGTGTCAnACCATCAAGACTCCGAGGAGGGAGCCAG 

GCCGTGAGCATCTCMTGGGGTGGCCATGAATGTACGCACAGGGAGTGCCAGGCCAA 

CGGTTCTGACTCAGAGAAACCATGAGGCACAGTCAACCCCACAGACT GCCGGTCAAG 

CCCTCAGACCTTGMGCCTGCCTGGGACTGCTGCG'ATGGGAC^AGCAGGWCC 

TGTGAGGGTCCCTGCCATAGCAGCTGGCTCTACGCAGACCGTCCAGACGGGAAGCCC 

nGACCCTCATAGGAGCTWGGGCCCAGACTCTGACAAAGTGCCAAAGCCACAGA^ 

TCTCCAGAGGGAGACGTGGACTTWTTAAGGCTGGCATAGCTCCGTCCTTAGACTGA 

AGACAGAATCCAAACCATGCTGTACAAfiAGGCCAnGAGCCAGAGCTGGACTTGGTG 

AATCAnGTACCGGGCCCnCMCTGTCCCGCAGGCCTCTCCTTGTGGTACAAGCCC 

ATCACCAC(^CTAGCGGTGCCTCTGCMCGG(>ACCCTGGGTTnAAATTGCT6T 

CTAAAAATGTAMTAGATATAMTCTCTCCCTGGACTTGGGAGAAGATGGGAGCTGT 

GTATG(TrTAWCTGCTR6ACTCTGCAGCCACTGGAGAGCCATGAMTGGCATCT 

ACTCTAnGCCCAAGGAAGCTTGWC^UriTGTACTTGMTCTGGMTGAGTCM 

AGCTGGTCCWTGCCCAGGTAGGGAGCATATQ^TGTGAAGnGACACAGCTTCT 

GCAGCTGCncCGGCCAGGGAGTGTGGGTGTTCCGCTGGCCTTTGGAGCACACGGCA 

CATGTGCTGCGTATGTGTGCCTCAGCTCCATGGAT(XAGGTCAGGGCTCACCTGT6A 

GGAGTGGGC^CATAGCTATGATATGAAACTCrGACCACCCreCCACCCCCACGCCC 

CCCCCCCCCCCCCGCTCCTGATGCAGTGAGGCGACTCTGGAGCCnTTCCCAGTCAGC 



FIGURE 7 CONTINUED 



SUBSTITUTE SHEET (RULE 26) 



WO«>7/072O9 



PCT/GB96/01980 



8/17 



CCAGTGCGCAGCGGGAGGACTTCTGCTCCTGCTCCCATAA6CTCTCAGAGTGCTCAT 
GnCCTGAT TCTGAG GGAGCCCAGAGGTCTTCCMCGCTGTCACGGTCCTTCCTGCC 
AATGTCIACTTmCAGACTCCAGGOT 

AAATCTGACCAATGACATCACTTTGCTTCAAAT6ACCAATTGTGCAAAGAAACAAAG 
CCAGAGXGCCCGTTnCAATGGTTACCACATCCTTT^ 
CTGTGACCAGCCACTGAGAGCCACGGTGCAGCCAAGGCCAAGGGATGGGAGCCTGGA 
GTTACTCAGACACGTTTACTTCAGWTGGACTGTTGTCTGMTCAGGTCCCCAAAGT 
ACATGGGGTGCACAGTCGCTCGGAATGGAGAACCTCAGGCGAGGCGGTCAAAGGCCA 
GGACTTCTCACCAAGCTGTGCTCTCAGAAGTGCACAGGACTGCTAGTGACTGACTGG 
TGAGATGAAAGCCTACAAGACATGGCCTGGGGTCACGCACTGCCAAGAGGCTGACGG 
GAGGCCCAGGT AGCCCAAGGATGGCAAGGATACAGAGTGACCTAGCACAGGGGAGCT 
TCAGTCCCAGGTGGTACAGCACC^TGACMCTCCGCAACCCACGCCACCTCAGAAGG 
T GAAGT TTnGAnCGATCACAACTATTAGCAAA 
GTTTTTCTGACCTAAGACTTTCCTTGACCGTAAGACATGGAGATT^ 
TTTAnACTGTTCAGCACTGGATGGCAACACAGGTGAGATGATGCATCTATAATAAA 
TTAAGATTTTTGGAT 
AAAAAAAA 
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ME2 PROTEIN 1-1798 AMINO ACIDS 

NSGLMEKLKQIEEQTKKAQQELEEQTRR^ 
SNGYFLIDMTGAVTTARSLDRETKDTHVLKV^^ 

IRENLEVGYEVLTIRATD60APSNANMRYRLLE6AGGVFEIDARSGVVRTRAWDREEAAEYQLLVEAND 
QGRNPGPLSASATVH I WEDENDNYPQFSEKRYVVQVPEDVAVNTAVLRVQATDRDQGQNAAI HYS I VSG 
NLKGQFYLHSLSGSLDVINPLDFEAIREYTLRIKAQDGGRPPLINSSGLVSVQVLOVNONAPIFVSSPFQ 
AAVLENVPLGHSVLHIQAVDAOAGENARLQYRLVDTASTIVGGSSVDSENPASAPDFPFQIHNSSGWITV 
CAELDREEVEHYSFGVEAVDHGSPAMSSSASVSITVLDVNDNDPMFTQPVYELRLNEDAAVGSSVLTLRA 
RDRQANSVITYQLTGGNTO^^ 

ANTHRPVFQSSHYTVSVSEDRPVGTS I ATI SATOEDTGENAR ITYVLEDPVPQFR IDPDTGTI YTHTELD 
YEDQAAYTLAITAQDNGIPQKSOTTSLEILILDANDNAPRFLRDFYQGSVFEDAPPSTSVLQVSATDRDS 
GPNGRLLYTFQGGDDGDGDF Y I EPTSGV I RTQRRLDRENVAVYNLWALAVDRGSPNPLSASVGI QVSVLD 
I NDNPP VFEKDELELF VEENSPVGS WAR I RANDPDEGPNAQI I YQI VEGNVPEVFQLDLLSGDLRALVE 
LDFEVRRDYML VVQATSAPL VSRATVH IRLLDQNDNPPELPDFOI LFNNYVTNKSNSFPSGVIGRI PAHD 
PDLSDSLNYTFLQGNELSLLLLDPATGEL0LSRDLDNNRPLEALMEVSVSDGIHSVTALCTLRVT1ITDD 
MLTNS ITVRLENMSQEKFLSPLLSLFVEGVATVLSTTKDO I FVFN I QNDTDVSSN I LNVTFSALLPGGTR 
GRFFPSEDLOEQIYLNRTLLTTISAQRVLPFDDNICLREPCENYMKCVSVLRFDSSAPFISSTTVLFRPI 
HPITGLRCRCPPGFTGDYCETEIDLCYSNPCGANGGCRSREGGYTCECFEDFTGEHCQVNVRSGRCASGV 
CKNGGTCVNLLIGGFHCVCPPGEYEHPYCEVSTRSFPPQSFVTFRGLRQRFHFTVSLAFATQDRNALLLY 
NGRFNEKHDFIALEIVEEQLQLTFSAGETTTTVTPQVPGGVSDGRWHSVLVQYYNKPNIGHLGLPHGPSG 
EKVAVVTVODCDAAVAVHFGSYVGNYSCAAQGTQSGSKKSLDLTGPLLLGGVPNLPEDFPVHSRQFVGCM 
RNLSIOGRIVDMAAFIANNGTRAGCASQRNFCDGTSCQNGGTCVNRWNTYLCECPLRFGGKNCEQAMPH^ 
QRFTGESVVLWSDLDITISVPWYLGLMFRTRKEDGVLMEATAGTSSRLHLQILNSYIRFEVSYGPSDVAS 
MQLSKSRITDGGWHHLLIELRSAKEGKDIKYLAVM^ 

SVRHGFRGCMQGVRMGESSTN I ATLN^DALKVRVKDGCDVEDPCASSPCPPHRPCRDTWDSYSC I CDRG 

YLEKKCVDACLLNPCKHVGSLCALPNTPRGYSCECGPGHYGQYCENKVDLPCPtCGWWGNRCVAPVTVLSA 

KALIPTATRPMASARRITTSPQPRIVAFPmSPRSHSRACmDTG(P\CKPGVIGROCNRCDNPFAEVT 

SLGCEVIYNGCPRAFEAGIWWPQMKFGQPAAVLCPKGSVGNAVRHCSGEKGWLPPELFNCTSG5FVDLKA 

LNEKLNRNETRHDGNRSLRLAKALRNATQGNSIIFGNOVRTAYQLLARILQHESRTO^ 

EDWHTGSALLAPATEASWEQIQRSEAt^LLRHFEAYFSNV^^ 

KLNFTGAQVPRFEDIQEELPRELESSVSFPADTFKPPEKKEGPVVRLTNRRTTPLTAQPEPRAERETSSS 

RRRRHPOEPGQFAVAL W I YRTLGQLLPEHYDPDHRSLRLPNRPV I NTPVVSAMV Y SEGTPLPSSLQRP I 

LVEFSLLETEERSKPVCVFWNHSLDTGGTGGWSAKGCELLSRNRTHVTCQCSHSASCAVLMDISRREHGE 

VLPLKIITYAALSLSLVALLVAFVLLSLVRTLRSNLHSIPQEPIHALFFSQLIFMVGINQTENPFLCTW 

AILLHYVSMGTFAWT1VENLHVYRMLTEVRNIDTGPMAFYHVVGWGIPAIVTGLAVGLDPQGYGNPDFCW 

LSLQDTL IWSFAGPVGTVI I INTVIFVLSAKVSCQRKHHYYERKGVVSMLRTAFLLLLLVTATWLLGLLA 

VNSDTLSFHYLFAAFSCLQGIFVLLFYCVANREVRKHLRAVL^KKLQLDDSATTRATLLTRSLNCNNTY 

SEGSRHAPHRPGQSTASLDSTTRDEGVQKLSVSSGPARGNHGEPDASF I PRNSKKAHGPDSDSDSELSLD 

EHSSSYASSHTSDSEK)GGEAEDICWNPAGGPAHSTPKADAU\NHVPAGiWPOES 

RPRSAWSYTGRRRAITVATGPLTRKVGSWPSQWPCLAASPRSSGKAS* 
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ME2 DNA SEQUENCE 1 - 8210 

GAAnCCGGGCTGATGGAGAAGCT6AAG(^GATTGftGGAGCA6ACTAAGAAGGCTCAGC 

AAGA6CTG6AAGAGCAGACCCGCAGGGCCCTAGAACTT6AGCAGGAACGGAAGCGTGCG 

GGCACTGCGGTWTCGAACTGCGCGCGCACGAC^^ 

TAGCTACCAGATGGAGGCGCTGTTCGATGAGCGCTCT 

CCfmCGGGTGCAGTGACCACAGC^^ 

GTACTCAAAGTTAGTGCTGTG6ACCACGGCTCCCCGAGGCGCTCAGCTGCCACCTACCT 

CACCGTAACTGTC^GTGACACTMCGACWCAGCCCAGTCTTTGAGCAffTCT 

GAGAGCGAATTCGAGAAAACCTGGAGGTGGGCTATGAGGTTCTGACCATCCGTGCCACC 

GACGGGGATGCCC(TTCCMCGCAAACATGCGCTATCGTCTGCTGGAGGGCGCAGGTGG 

TGTCTTTGAGATAGACGCACGATCAGGTGTCGTGCGC^CACGGGCTGTGGTGGA 

AGGAGGCGGCTGAGTACCAGCTGCTGGTGGAGGCCMTGACCAGGGTCGCAATCCAGGC 

CCACTCAGTGCCTCAGCCACCGTCCACATAGTGGTAGAA6ACGAGAATGACAACTACCC 

CWGTTCAGTGAGAAGCGCTATGTGGTTCAAGTCCCAGAAGACGTAGCCGTCAACACGG 

CTGTGCTTCGAGTCCAGGCCACTGACCGGGACCAGGGGCAGMTGCAGCCATACACTAC 

AGCATCGnAGTGGCMCCTGAAGGGTC^^ 

GGATGTTATCMCCCGCTGGACTTCGAAGCCATCCGGGAATACACCCTGCGCATCAAAG 
CCCAAGATGGGGGCCGGCCTCCTCTWTTMnOT 

TTAGATGTGMCGAt^TGCGCCCATCTTTGTGAGCAGCCCCTTTCAGGCTGCCGTGCT 

AGAGAATGTGCCCCTCG^CACTCAGTCCTGCACATCCAAGCGGTGGACGCAGATGCAG 

GGGAGAACGCCAGGCTGCAGT ACCGTCTAGTGGACACAGCCTCCACTATCGTGGGGGGC 

AGMTGTCGACTCTGAGAACCCTGCCTCTGCCCCAGACTTCCCCTTCCAAATCCACAA 

CAGCTCCGGTTGGATTACTGTGTGCGCQGAGCTGGACCGTGAGGAGGTGGAACACTATA 

GCTTTGGAGTA6AAGCAGTGGACCATGGCTCACWGCCATGAGCTCCTCTGCCAGCGTG 

TCWTCACAGTGCTGGATGTAMTGATAACGACCCCATGTTCACGCAGCCTGTGTATGA 

GCTGCGTCT6AATGAGGATGCGGCTGTCGGGAGCAGCGTGCTGACCCTCAGGGCCCGAG 

ACCGTGATGCCMTAGTGTGATCACCTACWGCTGACGGGTGGGAACACCCGCAACCGC 

nCGCACTCAGCAC£CAGAGCGGCGGTGGCCnATCACOT 

CMGCAGGAACGGCAGTATGTGCTGGCTGTrGACCGCGTCCGATGGCACGCGTTCACACA 

CCGCGCAGGTCTTTATCAACGTTACAGATGCCAACA^ 

TCCCACTACACGGTCAGTGTGAGTGAAGAtXGGCCCGTGffiCACCTCCATCGCTACCAT 

C^TGCCACGGATGAGGATACGGGTGAGAACGCCCGCATCACCTATGTGCTAGAGGATC 

CCGTACCACAGncCGCATTGACCCGGACACTGGCACCATTTATACCATGACGGAACTG 

GACTATGAGGACCAGGCTGCCTACACGCTGGCCATCACGGCTCAGGACMTGGK^TTCC 

TCAGAAGTCAGACACTACCTCTCTGGAGATCCTrATCCTCGACGCCMTGAWACGCK 

CC^TTCCTGCGAGAmCTACCAGGGnCTGlTnCGAGGA 

AGTGTCCTCCAGGTCTCTGCTACAGACCGTGACTCAGGCCCTMTGGCCGCCTCCTGTA 

CACTnCCAGGGTGGGGATGATGGAGATGGAGATTTCTACATTGAGCCCACGTCTGGTG 

TGATCCGTACCCAGCGCCGGCTGGAC^^ 

CTCGCTGTGGATCGGGGCAGCCCGAATCCCCTCAGTGCGTCAGTGGGMTTCAGGTGAG 

TGTGTTGGACAnAACGACAACCCCCCAGTGTTTGAGAAGGACGAGCJGGAGCTGTTTG 

TGGAAGAGMCAGCCCTGTGGGnCAGTGCTAGCAAGAATAAGGGCCMCGACCCGM 

GAAGGTCCGAATGCTCAGATCATTTATCAGATCGTGGAGGGCAATGTGCCCGAGGTCTT 

CCAGCTGGACCTACTGAGTGGTGACCTCCGTGCCCT(^TCGAGnGGATTTTGAGGTCC 

GGAGGGACTATATGTTGGTGGTGCAGGCCACGTCTGCTCCTCTGGTAAGCCGGGCCACC 

GTGf^CATCCGTCTCCTGGACCAGAATGACAACCCACCGGAGTTGCCTGACTTCCAGAT 

CCTTnCMCMCTATGTCAC(^TAAATCCMCAGmCC(XAGTGCT 

GCATCCCAGCCCACGACCCTGACCTATCTGACAGCCTCMTTACACCTTTCTGCAAGGC 

AACGAGCTGAR:CTGCTGCTGCTGGATCCCGCCACAGGAGAGTTG(^GCTCA£CCGTO 

TCTGGACAACAACCGGCCACTGGAGGCGCTCATGGAGGTGTCTGTGT CAGATGGTATCC 

ACAGCOTCACCGCTCTCTGCACTCTGCGCGTGACCATCATTACAGATGACATGCTGACC 

AACAGCATCACTGTCCGCCTGGAGAACATGTCGCAGGAGAAGTTCCTGTCCCCGCTGCT 

GTCCCTCTTTGTAGAAGGGGTGGCCACAGTACTGTCCACCACCAAGGATGACATOT 

TCTTCMCATCCAGMCGACACGGACGTCAGCTCCMOT 

GCACTGCTCCCCGGTGGWCCCGTGGCCGGTTCnCCCGTCAGAGGACCTGCAGGAGCA 

GATCTACCTWCCGGACACTGCT(^CCACCATCTCraCCAGCGTGTGaGCCCTTTG 

ATGACAACATCTGCCTGAGGGAGCCCTGCGAGAACTACATGAAGTGCGTGTCCGTGCTT 
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AGGTTTGACAGnCGGCACCClTCAnAGTTCCACCACGGTGCTCTTCCGGCCTATC^ 

TCCCATCACGGGCCTGCGCTGCCGCTGCCCGCCGGGTTTCACCGGGGACT ACTGCGAGA 

CTGAAAnGATCTTTGCTACTCCMTCCGTGCGGGGCC^^ 

GAGGGTGGCTACACTTGTGAGTGCTTCGAGGACTTCACTGGG6AGCATTGCCAGGT6AA 

CGnCGCTCAGGCCGCT6TGCCAGCGGAGTATGCAAAAACGGGGGCACCTGCGTGAACC 

TGCTCAnGGAGGCnCCACTGTGTGTGCCCGCCCGGCSAGTATGAGCATCCCTACTGT 

GAAGTGAGCACCAGGAGCTTCCCACCCCAGTCC7TCGTTACCTTCCGAGGCCT^{^CA 

ACGCnCCACTT(XCGTCTCC(rrGGCG^ 

TCTACMTGGCCGCTTCAATGAGAAGCACGACTTCATCGCCCTGGAGATTGTGGAGGAG 
CAGCTGCAGCTCACGTTCT CGGCAGGTGAGACCACAACCACGGTGACACCGCAGGTTCC 
TGGAGGTGTGAGCGATGGGCGGTGGCATTOTTGCTGGTGCAGTACTACAACAAGCCCA 
AWTTGGCCACCTGGGCCTGCCCCACGGGCCGTCTGGAGAGAAGGTGGCTGTGGTGACT 
GTGGATGACTGTGACGCAGCGGTGGCCGTGCACTTTGGAAGTTACGTGGGGAACTACAG 
CTGCGCTGCCCAGGGCACTD^GAGCGGCTCCAAGAAGTCACTGGATCTGACTGGTCCTC 
TGCTTCTGGGTGGTGTCCCCAACCTGCCAGMGAOT 

GTGGGATGCATGQaAAACCTGTCCATCGATGGCCGGATTGTGGACATGGCTGCGTTTAT 

T(XmWTGGTACCAGGGCAGGCTGTGOTCTCAGAGGAACTTG'GCGATGmCCT 

CATGCCAGAACGGGGGCACCTGTGTGAACAGGTGGAACACGT ACTTATGTGAGTGCCCG 

CTCCGCTTTGGTGGAAAGAACTGTGMCAAGCTATGCMCA^^ 

TGAGAGC^TCGTGTTGTGGAGTGACCTTGACATCACCATCTCTGTGCCTTGGTACCTGG 

GGCTCATdTTCCGGACCCGGAAGGAGGATGGTGTGCTGATGGAAGCCACAGCTGGCACG 

TCnCCAGGCTCCATCTCCAGAnCTCAACAGCTACATCCGCTTTGAffiTCTCCTACGG 

CCCCTCTGACGTGGCATCCATGCAGCTGTCCAAGTCCCGGATAACTGACGGGGGGTGGC 

ATCACCTGCTCATAGAACTGAGGAGTGCCAAGGAGGGCAAGGACATCAAATACCTGGCA 

GTCATGACCTTGGACTATGGGATGGACWGAGCACAGTGCAGATTGGGAATCAGCTTCC 

TGGGnGAAGATGCGGACTAnGTCAT(^GAGGTGTGACCGAGGACAAGGTCTCTGTCC 

GCCATGGTnCCGAGGCTGTATGCAGGGAGTGAGGATGGGAGAGAGCTCCACCAACATT 

GCCACCCTGAACATGAATGACGCCCTCAAGGTCAGGGTGAAGGACGGCTGTGATGTGGA 

GGACCCATGTGCCTCAAGCCCCTGCCCTCCCCATAGACCCTGCCGT6ACACATGGGACA 

GCT ACT CCTGCATCTGTGACAGAGGGTACTTG6AAAAAAAGTGTGTGGATGCGTGTCTC 

CTGAACCCCTGCAAGCACGnGGCAGCCTGTGTGCGCTCCCCAACACTCCTCGAGGCTA 

CTCCTGCGAGTGCGGACCCGGCCACTATGGGCAGTACTGTGAGAACAAAGTCGACCTTC 

CGTGCCCCAAAGGCTGGT(^GGGAACCGGTGTGTGGCCCCTGTCACTGTGCTGTCAGCC 

AAGGCTTTGATCCCGACTGCAACAAGACCAATGGCCAGTGCAAGGAGAATTACTACAAG 

CCCCC^CCAGGATCGTraOTCCCTGTGACTmTCCCCCC^ 

CCTGCGACATG6ACACTGGGCAGTGTGCCTGCAAGCCTGGTGTCATCGGCCGTCA6TGC 

AACCGCTGTGATMTCCTnCGCGGAGGTCACCTCGCTOGCTGTGAAGTGATCTACAA 

TGGGTGTCCCAGAGCATTTGAGGCTGGCATCTGGTGGCCACAGATGAMTnGGGCAGC 

CAGWGCGGTGCTATOCCCAAAGGATCCGTGGGTMCGt^TCCGGCACTGCAGTGGG 

GAGAAGGGCTGGCrTCCCCCAGAGCTCTTCAACTGCACCTCTffiCTCCTTTGTGGACCT 

CAAGGCCTTGAACGAGAAACTGAACCGCAACGAGACAAGAATGGACGGGAACCGGTCCC 

TGCGGCTGGCAAAGGCTCTGAGGMCGCCACGCAGGGGMCAGCACCCTCTTTGGCAAT 

GATGTGCGCACQGCCTACCAGCTTCTGGCCCGCATGTACAGCATGAGAGCCGCCAGCA 

GGGCTTTGACCTGGCAGCCACCCGAGAGGCTAATTTTWTGAGGATGTCGTCCATACA^ 

GCAGCGCCCTCCTGGCCCCAGCTACAGAGGCATCGTGGGAACAGATCCAGCGAAGCGAG 

GCTGGTGCAGCGCAGCTACTGAGGCATTTCGAGGGATACTTCAGCAACGTGGCACGAAA 

TGTGAAGAGGACCTATCTGAGGCCCTTCGTCATCGTCACCGCCAACATGATTCTTGCAG 

TTGACATCTTCGACMGCTGMCnCACGGGTGCCCAGGTGCCAAGGTTTGAGGA(^n 

CAGGAAGAGCTCCCMGGGAGCTGGAGTCCTCCGTGTCCTT CCCAGCTGACACCTTCAA 

GCCACCAGAGAAAAAAGAAGGCCCCGTGGTGAGGCTGACCAACCGGAGGACTACCCCAC 

TCACGGCACMCWGAGCCCAGGGCTGAGAGGGftAACCTCATCCAGCAGACGGAGGAGA 

CACCCCGATGAGCCTGGACAGTTTGCTGTTGCCCTGGTTGTCATTTACCGGACCCTGGG 

TCAGCTGCTGCCTGAACACTATGACCCCGACCATCGCAGCCTCCGACTGCCTAACCGGC 

CTGTCATCAACACCCCCGTGGTGAGTGCTATGGTGTACAGTGAGGGAACCCCACTCCCC 

AGCTCTCTGCAGAGGCCTATCCTGGTGGAGTTCTCCCTGTTGGAGACGGAGGAACGAAG 

CAAACCTGTCTGTGTATTCTGGAACCACTCCCTCGACACTGGTGGGACTGGAGGGTGGT 

CAGCCAAGGGCTGTGAACTTCTGTC^AGGAACCGGACCCACGTCACTTGCCAGTGCA^ 

CAnCGGCCAGCTGCGCGGTGCTCATGGACATTTCCAGACGTGAGCACGGGGAGGTTCT 
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GCCCCTGAAGATCATCACCTATGCtH^CCTGTCCnGTCTTTGGTGGCCCTCCTGGTGG 
CCTTTGTCCnCTCTCGCTCGiTTC^ 

GAACCTATCCA(^TCTGTTCTTCTCCCAGCTWTC"TTCATGGTCGGCATCAACCAGAC 

TGAGMCCCGTTTCTCTGCACAGTGGTCGCCATCCTCCTGCACTACGTCTCCATGGGCA 

CCTTCGCCTGGACCCTTGTGGAGAACrTGCATGTCTACCGCATGCTGACAGAAGTGCGC 

AACATCGACACTGGGCCCATGGCG7TCT ACCACGTGGTGGGCTGGGGCATCCCTGCCAT 

TGTCACAG6ACTGGCTGTTGGCTTGGACCCTCAGGGCTATGGAAACCCTGACTTCTGCT 

GGCTGTCCCTTCAGGATACCCTGATTTGGAGCTTTGCTGGGCCTGTCGGAACGGTTATA 

ATCATCAAWCAGTCATCrrTGTCCTGTCTGCAAAGGTTTCCTGCCAA^^ 

TTATTATGAAAGAAAGGGGGTTGTCTCCATGCTGAGGACG^ 

TCGTCACTGCCACCTGGCTGCTGGGACTGCTGGCGGTCAACAGTGACACTCnAGCTn 

CACTACCTCXTTGCTGCCITtaGCrGOT 

CGTGGCCMWGGGAGGTGCGGAAGCACCTGAGGGCGGTGCTGGCAGGGAAGAAGCTGC 

AGCTGGATGACTCGGCCACWCTCGGGCCACTCTGCW 

MCACCTACAGCGAAGGGTCCAGACATGCTCCGCACCGCCCTGGGCAGTCCACAGCCTC 

TCTGGACAGTACCACCAGGGATGAAGGQGTCCAGAMCTO^TGTGTCCTCTGGCCCAG 

CCCGTGGTMCCATGGAGMCCAGATGCATCCTTCATCCCTAGGAACTCCAAAAAAGCT 

CACGKCCTGACTCTGACTaGACAGTGAGCTGTCCaGGACGAGC^ 

CGCCTCnCACACACATCQaACAGCGAGGATGATGGCGGAGAGGCTGAAGACAAATGGA 

ATCCGKTTGGGGGCCCCGCCCATAGCACCCCAAAAGCAGATGCTCTGGCCMC(^CGTC 

CCAGCTGGCTGGCCCGACGAGAGCCTGGCTGGGAGTGACAGTGAGGAGTTGGACACTGA 

GCCCCACCTGAAGGTGAGACCAAGGTCAGCGTGGAGTTACACCGGCAGGCGCAGGGCAA 

TtXTGTGGCGACCGGCCCTCTGACCCGGAAAGTGGGGTCCTGGCCAAGCCAGTGGCCG 

TGCTTAGCAGCCAGCCCCAGGAGCAGCGGAAAGGCATCCTGAAAAACAAAGTCACCTAC 

CCGCCGGCCATTGCCAGAGCAGCCACTGAAGTCCCGGCTGCGAGAGAAGCTGGCTGATT 

GTGAGCAGAGCCCCACATCCTCCCGCACATCCTCCCnGGaCT^ 

GtXACTGACTGTGTCATTACCATWAGACTCCGAGGAGGGAGCCAGGCCGTGAGCATCT 

CMTGGGGTGGCCATGAATGTACGCAWGGGAGTGCCAGGCCAACGGTTCTGACTC^ 

GAAACCATGAGGCACAGTCAACCCWCAGACTGCCGGTCAAGCCCTCAGACCTTC^ 

CTGCCTGGGACTGCTGCCTATGGGACAAGQUSGM 

GCAGCTGGCTCTACGCAGACCGTCCAGAC^GAAGCCCTTGACCCTCATAGGAGCTC^ 

GGCCCAGACTCTGACAAAGTGCCAAAGCCACAGATGTCTCCAGAGGGAGACGTGGAOT 

CAnAAGGCTGGCATAGaCCGTCCnAGACTGMGACAGMTCCAAACCAT^^ 

AAGAGGCCATTGAGCCAGAGCTGGACTTGGTGAATCATTGT ACCGGGCCCTTCAACTGT 

CCCGCAGGCCTCT COTGT GgTACAAGCCCATCACCACCAGCCTAGCGGTGC 

ACGGCAACCCTGGGTTnAAATTGCTGTCTAAAAATGTAMTAGATATAAATCTCTCCC 

TGGACnGGGAGAAGATfflSGAGCTG^ATGCTTTACACTGCTTTGACTCT 

TGGAGAGCCATGAMTGGCATCT(XTCTAnGCCCMGGAAGCnGO\CAGTTGT ACT 

TGAATCTGGAATGAGTCAACTCAGCTOTCCCAGTGCCCAGGTAGGGAGCATATGGGCT 

GTGAAGnGAWCAGCTTCTGCAGCTGCnCCGGCCAGGGAGTGTGGGTGT^ 

CCTTTGGAGCACACGGCACATGTGCrSXTATGTGTGCCTCAGCTCCATGGAT^ 

CAGGGCTaCCTGTGAGGAGTOGGCGCaTAGCTATGATATGAMCT^^ 

CCACaC(X(£CCCCCCCC^^ 

nCCCAGTCAGCCCAGTGCGCAGCGGGAGGACnCTGCTCCTGCTCCWTAAGCTCTCA 

GAGTGCTCATGTTCCTGATTCTGAGGGAGCCCAGAGG^ 

TTCaGCCMTOTGTACriTn^ 

CCCCTGAAATCTGACCAATGACATCACTnGCTTD\W 

AAAGCCAGAGTGCCCGTTTTCMTGGTTACCACATTCTITTTGGMTCTC^ 

ACCWGACCAGCCACTGAGAGCCACGCTGCAGCCAAGK 

GnACTCAGACACGTTTACTTCAGCATGGACTGTITGTCTGAATCAGGTCCCCAA^ 

ATGGGGTGCACAGTCGCTCGGMTGGAGMCCTCAGGCGAGGCGGTCAAAGGCCAGGAC 

TTCTCACCAAGCTGTGCTCTCAGAAGTGCACAGGACT^ 

GAAAGCCTAWAGACATGGCCTGGGGTCACGWCTGCCAAGAGGCTGACGGGAGGrc 

GGTAGCCCAAGGATGGCAAGGATACAGAGTGACCTAGCACAGGGGAGCn 

GTGGTACAGCACCGTGACMCTCCGCAACCXACGCCACCTCAGAAGGTGAAGTTTTTG^ 

nCGATCA WCTATTAGCAAAACA^ACCCrGTCAGTTTT AAACTGTTTTTCTGACCTA 

AGACTTTCCTTGACCGTAAGACATGGAGATTTTAACAGGTGTATTTA 
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ACTGGATGGCMCACAGGTGAGATBATGWTCTATMTAMTTAAGATTTTTGGATT^ 
TAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

FIGURE 9 CONTINUED 



SUB5TTTUTE SHEET (RULE 26) 



WO 97/07209 PCT/GB96/01980 

U/17 

FIG. 10 

5 10 15 20 25 30 35 40 45 50 

* * * * , 

N5GLM EKLKQ IEEQT KKAQQ ELEEQ TRRAL ELEQE RKRAG TAVTE LRAHD 

55 60 65 70 75 80 85 90 95 100 
***** 

C01 PDBGD AGRLS YQMEA LFDER SNGYF LIDAA TGAVT TARSL DRETK DTHVL 
105 110 115 120 125 130 135 140 145 150 

* -r 

KVSAV DHGSP RRSAA TYLTV TVSDT NDHSP VFEQS EYRER IRENL EVGYE 
155 160 165 170 175 180 185 190 195 200 
CD2 VLTIR ATDGD APSNA NMRYR LLEGA GGVFE IDARS GWRT RAWD REEAA 

205 210 215 220 225 230 235 240 245 250 

* * * * * 
EYQLL VEAND QGRNP GPLSA SATVH IWED ENDNY PQFSE KRYW QVPED 

255 260 265 270 275 280 285 290 295 300 
***** 

CD3 VAVITT AVLRV QATDR DQGQN AAIKY SIVSG NLKGQ FVLHS LSGSL DVTNP 
305 310 315 320 325 330 335 340 345 350 
LOFEA IREYT LRIKA QDGGR PPLIN SSGLV SVQVL DVNEN APIFV SSPFQ 

355 360 365 370 375 380 385 390 395 400 

* * * * • 

AAVLE NVPLG HSVLH IQAVD ADAGE NARLQ YRLVD TASTI VGGSS VDSEN 

CDt 405 410 415 420 425 430 435 440 445 450 
***** 
PASAP DFPFQ IHNSS GWITV CAELD REEVE HYSFG VEAVD HGSPA MSSSA 

455 460 465 470 475 480 485 490 495 500 

* -j- * * * * 

svsrr vudvn endpm ftqpv yelrl nedaa vgssv ltlra rdrda nsvit 

505 510 515 520 525 530 535 540 545 550 
***** 
CD5 YQLTG GNTRN RFALS SQSGG GLITL ALPLD YKQER QYVLA VTASD GTRSH 

555 560 565 570 575 580 585 590 595 600 

* -[-• * * * 
TAQVF INVTD ANTHR PVFQS SHYTV SVSED RPVGT SIATI SATDE DTGEN 

605 610 615 620 625 630 635 640 645 650 

* 

CD 6 ARTTY VLEDP VPQFR IDPDT GTCYT SfTELD YEDQA AYTLA ITAQD NGIPQ 

655 660 665 670 675 680 685 690 695 700 

KSDTT 5LEIL ILDAN DNAPR FLRDF YQGSV FEDAP PSTSV LQVSA TDRDS 

705 710 715 720 725 730 735 740 745 750 
***** 
CD7 GPN3R U-YTF QGGDD GDGBF YIEPT SGVTR TQRRL DREW AVYNL WALAV 

755 760 765 770 775 780 785 790 795 800 

DRGSP NPLSA SVGIQ VSVLD INDNP PVFEK DELEL FVEEM SPVGS WARI 



SUBSTITUTE SHEET (RULE 26) 



WO 97/07209 PCT/GB96/01980 
^ 15/17 

FIG. 10 CONTINUED 

805 810 815 820 825 830 835 840 845 850 

* « • * * 

CQB RAND? DEGPN AQIIY QIVEG NVPEV FQLDL LSGDL RALVE LDFEV RRDYM 
855 860 B65 870 875 880 885 890 895 900 
LWQA TSAPL V5RAT VHXRL LDQND NPPEL PDFQI LFNNY VTNKS NSFPS 

905 910 915 920 925 930 935 940 945 950 

* * * # * 

C09 GJIGSL IPAKD PDLSD SLNYT FLQGN ELSIL LLDPA TCELQ LSRDL ENNRP 
955 960 965 970 975 980 985 990 995 1000 
LEALM EVSV3 DSIHS VTALC TLKVT IITDD MLTNS ITVRL ENMSQ EKFLS 
1005 1010 1015 1020 1025 1030 1035 1040 1045 1050 

* * 9 * * 

PLLSL FVEGV ATVLS TTKDD IFVFN IQNDT DVS5N ILNVT FSALL PGGTR 
1055 1060 1065 1070 1075 1080 10B5 1090 1095 1100 

GRIT? SSDLQ EQIYL NRTLL TTISA QKVLP FDDNI CLRE? CENYM KCVSV 
1105 1110 1115 1120 1125 1130 113 5 1140 1145 1150 

r * Div "EGF -r 

LRFDS SAPTI SSTTV LFRPI HPCTG LRCEC PPGFT GDYCE TEIDL ^£YSNP E6F1 

1155 1160 1165 1170 1175 1180 1185 1190 1195 1200 
— I * j— 

CGANG GCRSR EGGYT £ETFE BFTGE HCQVN VRSGR ^SGV CKNGG TCVML QJf 2 

12C5 1210 1215 1220 1225 1230 1235 1240 1245 1250 
LIGGF BgUTCP FGEYE HPYtjE VSTES FPFQS FVTFK GLRQR FHFTV SLAFA 

1255 1260 1265 1270 1275 1280 1285 1290 1295 1300 
TQDRN ALLLY NGRTN EKHDF IALEI VEEQL QLTFS AGETT TTVTP QVPGG 

1305 1310 1315 1320 1325 1330 1335 1340 1345 1350 
VSDGR WKSVL VQYYN KPNIG HLGLP HGPSG EKVAV VTVDD CDAAV AVHFG 

1355 1360 1365 1370 1375 1380 1385 1390 1395 1400 
SYVGN YSCAA QGTQS GSKKS LDLTG PLLLG GVPML PEDFP VHS^JfVGCK 

1405 1410 1415 1420 1425 1430 1435 1440 1445 1450 
RNLSI DGRIV DMAAF IANNG TRAGC ASQRN FCDGT SCQNG GTCVN RWKTY 

1455 1460 1465 1470 1475 1480 1485 1490 1495 1500 EGF3 
L£ZT? LBFGG KlSx3 AMPKP QRFTG ESWL WSDLD ITI5V PWYLG LMFPTT 
1505 1510 1515 1520 1525 1530 1535 1540 1545 1550 

RKEDG VLMEA TAGTS SRLHL QHJNS YTRFZ VSYGP SDVAS MQLSK SRITD 
1555 1S60 1565 1S70 1S75 1S80 1585 1590 1595 1600 

* » » •> • 

GGWHH T.MKT, RSAKE GKDIX YLAVM TLDYG MDQST VQXGN QLPGL BMRXI 
1605 1610 1615 1620 1625 1630 1635 1640 1645 1650 
VIGGV TEDKV SVRHG FRGCM QGVRM GESST NIATL NMNDA LKVRV KDGCD 
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X655 1660 1665 1670 1675 1680 1685 1690 1695 1700 

EGFi r - ^ESV ~i r 

VEDP£ ASSPC PPHRP CRDTW OSYSg ICDRG YLEKK LLNPC KHVCS 



E0F5 



1705 1710 1715 1720 1725 1730 1735 1740 1745 1750 
LCALP NTPRG YK)e£ GPGHY GQYCE NKVDL PCPKG WWGNR CVAPV TVLSA 
1755 1760 1765 1770 1775 1780 1785 1790 1795 1800 

• r • DIVERGED EBF 

KALIP TATRP MASAR RITTS PQPRI VAFPV TVSPR SHSRA CEKDT GQCAC 

1805 1810 1815 1820 L825 1830 1835 1840 1845 1850 

• KPGVI GRQOJ RCDNP FAEVT SLGCE VIYSC CPRAF EfiGIW WPQMK FGQPA 

1355 1850 1865 1370 1875 1BB0 1885 1890 1895 1900 

AVLC? KGSVG NM7RH CSGEK GWLPP ELFEJC "tSGSF VDLKA LNEKL NRNET 

_J 

1905 1910 1915 1920 1925 1930 1935 1940 1945 1950 
RKDGN RSLFX AKALR NATQG NSTLF GNDVR TAYQL LARIL QHESR QQGFD 

1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 

LAATR EANFH EDWH TGSAL LAFAT EASWE QIQRS EAGAA QLLRH FEAYF 

2005 2010 2015 2020 2025 2030 2035 2040 2045 2050 
• * * * • 

SNVAR NVKRT YLRPF VTVTA NtCLA VDTFD KLNFT GRQVP RFEDI QEELP 
2055 2060 2065 2070 2075 2080 2085 2090 2095 2100 

RSLES SVSFP ADTFK PPEKK EGPW HLTNR Kl'l'PL TAQPE PRAER ETSSS 
2105 2110 2115 2120 2125 2130 2135 2140 2145 2150 

RRHBH PDEPG QFAVA LWIY RTLGQ LLPEH YDPDH RSLRL PNRFV INTPV 
2155 2160 2165 2170 2175 2180 2185 2190 2195 2200 

VSAHV YSECT PLPSS LQRPI LVEFS LLETE ERSKP VCVFW NHSLD TGGTG 

2205 2210 2215 2220 2225 2230 2235 2240 2245 2250 

* ~T~ 

GWSAK GCSLL SHNPT HVTCQ CSHSA SCAVL KDISR REHGE VLPLK IITYA 

. TM1 -L- TM1 

2255 2260 2263 2270 2275 2280 2285 2290 2295 2300 

# r TM-2 -i • TUO 

ALSLS LVALL VftFVL LSLVR TLRSN LHSI? ^EPIH ALFFS VGINQ |M2 

2305 2310 2315 2320 2325 2330 2335 2340 2345 2350 

TM3 i * r* ' * 

TENPF LCTW AILLH YVSM3 TFAWT LVEUL HVYPM LTEVR NUTTG.PMAT/ JM3 
2355 2360 2365 2370 2375 2380 2385 2390 2395 2400 

-r • - r-" TM5- TUC 

TM/» KWGK GIrAI VT GLA VGLDP QGYGN PDFCW L5LQD TLIWS FAGPV GTVXI fM5 
2405 2410 2415 2420 242S 2430 2435 2440 2445 2450 

— i * r~ " TMB " Tt jf- 

INTVI FVLSA KVSCQ RKHHV iZRKG WSML BJCAFL LLLLV TATKL LGLLA [M5 
2465 2460 2455 247C 2475 2480 2485 2490 2495 2500 

* i- • TM7 • -i 

VNSDT LSFHY LFAAF SCLQG IrVLL FYCVA NREVR KHLRA VLAGK KLQI-D 
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25C5 2510 2515 2520 2525 2530 2535 2540 2545 2550 
DSAT7 3ATLL TRSLN CMNTY SEGSR HAPHR PGQST ASLDS TTRDE GVQKL 

2555 2560 2565 2570 2575 253C 2585 2590 2595 2600 

S7SSG PAF.GN KGEPD ASFI? RNSKK AHGPD SDSDS ELSLD EHSSS YASSH 

2605 2510 26L5 2620 2625 2630 2635 2640 2645 2650 
» * * * * 

TSDSE DDGGE AEDKW MPAGG PAHST PKADA. LANHV PAGWP OESLA GSDSE 
2655 2660 2665 2670 2675 2680 2685 2690 2695 2700 

* m • 9 * 

ELDTE PHLKV RPRSA WSYTG RERAI TVATG PLTRK VGSWP SQWPC LAASP 

2705 
RSSCK AS 
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